About
IAAS platform
Activity
-
Had a fantastic time at the Ray Meetup hosted by ByteDance in San Jose! It was incredible to see the community come together and dive deep into AI…
Had a fantastic time at the Ray Meetup hosted by ByteDance in San Jose! It was incredible to see the community come together and dive deep into AI…
Liked by Tony Zhang
-
I’m excited to join Databricks as a Staff Software Engineer, working on LLM inference performance. Especially as the capability frontier builds upon…
I’m excited to join Databricks as a Staff Software Engineer, working on LLM inference performance. Especially as the capability frontier builds upon…
Liked by Tony Zhang
-
It is glad to share that our paper "Scaler: Efficient and Effective Cross Flow Analysis" has been accepted to ASE'2024. Scaler develops a…
It is glad to share that our paper "Scaler: Efficient and Effective Cross Flow Analysis" has been accepted to ASE'2024. Scaler develops a…
Liked by Tony Zhang
Experience
Education
Volunteer Experience
-
Student Helper of Education Info Day
The Hong Kong Polytechnic University
- Present 14 years 10 months
Education
-
Volunteer of Flag Selling Day
Hong Kong Rehabilitation Power
- Present 15 years 8 months
Poverty Alleviation
Publications
-
Deterministic Crash Recovery for NAND Flash based Storage Systems
51st Design Automation Conference, San Francisco, CA
Courses
-
Advanced Storage Systems
15746
-
Big Data Studio
15648
-
Cloud Computing
15619
-
Database Applications
15615
-
Distributed Systems
15640
-
Machine Learning
10601
-
Multimedia Database and Data Mining
15648
Projects
-
Auto-provisioning Suggestion System for Spark
- Present
Designed and Implemented an auto-provisioning suggestion system to provide an appropriate number and type of AWS resources to meet both performance expectations and budget constraints.
• Profiled metrics for different iterative machine learning and data mining applications on Spark.
• Estimated performance and budget results based on the profiling metrics and AWS EC2 instance resources.
• Conducted comparisons and provided suggestions based on the estimation of performance and budget…Designed and Implemented an auto-provisioning suggestion system to provide an appropriate number and type of AWS resources to meet both performance expectations and budget constraints.
• Profiled metrics for different iterative machine learning and data mining applications on Spark.
• Estimated performance and budget results based on the profiling metrics and AWS EC2 instance resources.
• Conducted comparisons and provided suggestions based on the estimation of performance and budget results. -
MapReduce Engine
Designed and Implemented a MapReduce Facility similar to Hadoop capable of dispatching parallel map and reduce processes across multiple hosts, as well as recovering from worker failure.
• Designed and implemented the basic MapReduce framework, including master and Slave, map and reduce, fault tolerance, scheduling and concurrency modules.
• Designed a pseudo-random data distribution function to look up address of slave storage devices without relying on a central metadata server. -
Twitter Analytics Web Service on AWS
Designed, developed and deployed a web service for Twitter data analysis that met the throughput, budget and query requirements of the client.
• Extracted, Transformed and Loaded process for six different kinds of client queries by extracting JSON Twitter dataset, transforming dataset into a specific format and loading the data into HBase/MySQL on AWS.
• Designed and built backend HBase and MySQL using AWS resources. Utilized MySQL Cluster and Elastic Load Balancer to meet the throughput…Designed, developed and deployed a web service for Twitter data analysis that met the throughput, budget and query requirements of the client.
• Extracted, Transformed and Loaded process for six different kinds of client queries by extracting JSON Twitter dataset, transforming dataset into a specific format and loading the data into HBase/MySQL on AWS.
• Designed and built backend HBase and MySQL using AWS resources. Utilized MySQL Cluster and Elastic Load Balancer to meet the throughput and budget requirement. -
Cloud File System
-
Built a Cloud File System using file-system in user space (FUSE) based on different storage devices of SSD as well as Amazon s3 storage systems.
• Designed and implemented a hybrid Fuse file system to leverage the properties of SSDs and cloud storage by placing small data objects on SSD for fast retrieval and big data objects on the cloud storage.
• Designed and implemented block level deduplication of objects on the cloud storage to save space and network transfers, and a Cloud FS cache…Built a Cloud File System using file-system in user space (FUSE) based on different storage devices of SSD as well as Amazon s3 storage systems.
• Designed and implemented a hybrid Fuse file system to leverage the properties of SSDs and cloud storage by placing small data objects on SSD for fast retrieval and big data objects on the cloud storage.
• Designed and implemented block level deduplication of objects on the cloud storage to save space and network transfers, and a Cloud FS cache to improve the read/write operation efficiency by accessing data from cache rather than cloud storage systems
• Designed and implemented some snapshot related functions to benefit the consistency and recovery of file system, including create, delete, restore and list multiple snapshots
Honors & Awards
-
Outstanding Student in Intellectual Development of Hong Kong Polytechnic University
Hong Kong Polytechnic University
-
zEnterprise Contest 2012 1st Runner-up
IBM China/ Hong Kong Limited, Hong Kong
-
HKSAR Government Scholarship Fund 2012/13 – Talent Development Scholarship
Hong Kong Special Administration Region Government
-
HKSAR Government Scholarship in Economics 2012/13
Hong Kong Special Administration Region government
-
COMP Student of the Year with Outstanding Academic Performance 2011/12
Hong Kong Polytechnic University
COMP stands for Department of Computing, Hong Kong Polytechnic University.
-
COMP Scholarship for Non-local (Chinese mainland) Students
Department of Computing, Hong Kong Polytechnic University
This scholarship lasts for five years which covers all tuition fee and living expenses.
Languages
-
Chinese
Native or bilingual proficiency
-
English
Professional working proficiency
More activity by Tony
-
Join us for the vLLM & NVIDIA Triton User Meetup. Dive into the latest in #AI with expert talks, updates, and networking. Secure your spot now ➡️…
Join us for the vLLM & NVIDIA Triton User Meetup. Dive into the latest in #AI with expert talks, updates, and networking. Secure your spot now ➡️…
Liked by Tony Zhang
-
Our latest work unveils Bytedance's checkpoint format for the LLM training for multi-framework (Megatron, FSDP, etc), efficiency optimizations for…
Our latest work unveils Bytedance's checkpoint format for the LLM training for multi-framework (Megatron, FSDP, etc), efficiency optimizations for…
Liked by Tony Zhang
-
It is my pleasure to be invited to give a talk at 7th IEEE International Conference on Multimedia Information Processing and Retrieval, invited by…
It is my pleasure to be invited to give a talk at 7th IEEE International Conference on Multimedia Information Processing and Retrieval, invited by…
Liked by Tony Zhang
-
Join us at the 5th vLLM meetup hosted with AWS!
Join us at the 5th vLLM meetup hosted with AWS!
Liked by Tony Zhang
-
This week, ByteDance US Infrastructure lab will invite Zhaozhuo Xu, an assistant professor from Stevens Institute of Technology, to talk about…
This week, ByteDance US Infrastructure lab will invite Zhaozhuo Xu, an assistant professor from Stevens Institute of Technology, to talk about…
Liked by Tony Zhang
-
I'm excited to announce FireAttention v2: a breakthrough for long context inference. At Fireworks AI, we challenge ourselves to deliver high impact…
I'm excited to announce FireAttention v2: a breakthrough for long context inference. At Fireworks AI, we challenge ourselves to deliver high impact…
Liked by Tony Zhang
-
49 Engineering blogs worth reading to improve your system design: Engineering at Meta - https://lnkd.in/e8tiSkEv Google Research -…
49 Engineering blogs worth reading to improve your system design: Engineering at Meta - https://lnkd.in/e8tiSkEv Google Research -…
Liked by Tony Zhang
-
Excited to present our journey in building a fully managed stream processing platform on Flink at scale for LinkedIn at #FlinkForward San Francisco…
Excited to present our journey in building a fully managed stream processing platform on Flink at scale for LinkedIn at #FlinkForward San Francisco…
Liked by Tony Zhang
-
【Info Seminar – 2022/23 MSc in Blockchain Technology】 #PolyUCOMP is going to launch the NEW MSc Programme, and it is the FIRST blockchain master…
【Info Seminar – 2022/23 MSc in Blockchain Technology】 #PolyUCOMP is going to launch the NEW MSc Programme, and it is the FIRST blockchain master…
Liked by Tony Zhang
-
Looking for a Tech Lead Manager to join our infrastructure team! https://lnkd.in/gPDGUGY
Looking for a Tech Lead Manager to join our infrastructure team! https://lnkd.in/gPDGUGY
Liked by Tony Zhang
Other similar profiles
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore MoreOthers named Tony Zhang in United States
-
Tony Zhang
-
Tony Z.
Sr. Electrical Engineer Google | Power Electronics Ph.D. MIT
-
Tony Zhang
-
Tony Zhang
-
Tony Zhang
407 others named Tony Zhang in United States are on LinkedIn
See others named Tony Zhang