Personal Information
Organization / Workplace
Greater Seattle Area United States
Occupation
Principal Engineering Manager at Microsoft
Industry
Technology / Software / Internet
About
Leading the Streaming platform team in Application and Service Shared Data group, I am responsible for building Near Real time (NRT) streaming platform in Microsoft. After joining Microsoft in 2007, I have been spearheading the effort of building large scale distributed system for NRT data ingestion and processing by leveraging Open Source technologies including Apache Kafka, Spark and Elastic Search. Big data fascinates me and I am intrigued by the capability to process data at scale. I am also the founder and organizer of Seattle Apache Kafka Meetup.
Tags
kafka
distributed log ingestion
siphon
kafka seattle meetup
kafka meetup expedia seattle
seattle kafka meetup avvo
netflix
large scale
kafka multi tenancy in cloud
seattle meetup
exactly once semantic
eventhub
messaging system
kafka connect
tuning
performance
optimization
mirrormaker
mirus
data collection
datawarehouse
kafka replication
machine learning
deep learning
See more
Presentations
(17)Personal Information
Organization / Workplace
Greater Seattle Area United States
Occupation
Principal Engineering Manager at Microsoft
Industry
Technology / Software / Internet
About
Leading the Streaming platform team in Application and Service Shared Data group, I am responsible for building Near Real time (NRT) streaming platform in Microsoft. After joining Microsoft in 2007, I have been spearheading the effort of building large scale distributed system for NRT data ingestion and processing by leveraging Open Source technologies including Apache Kafka, Spark and Elastic Search. Big data fascinates me and I am intrigued by the capability to process data at scale. I am also the founder and organizer of Seattle Apache Kafka Meetup.
Tags
kafka
distributed log ingestion
siphon
kafka seattle meetup
kafka meetup expedia seattle
seattle kafka meetup avvo
netflix
large scale
kafka multi tenancy in cloud
seattle meetup
exactly once semantic
eventhub
messaging system
kafka connect
tuning
performance
optimization
mirrormaker
mirus
data collection
datawarehouse
kafka replication
machine learning
deep learning
See more