0% found this document useful (0 votes)

897 views4 pages

Big Data Engineer Resume Overview

Vishwa has over 7 years of experience as a big data engineer with expertise in technologies like Apache Kafka, Spark, Hadoop, Hive, Cassandra, and Python. He has worked on building streaming applications, data pipelines, and data integration projects for clients in various industries. His technical skills include Apache Spark, Scala, Python, data warehousing, machine learning, and containerization tools like Docker.

Uploaded by

HARSHA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

897 views4 pages

Big Data Engineer Resume Overview

Uploaded by

HARSHA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Vishwa | Sr. Big Data Engineer | vishwas.bigdata@gmail.

com| Phone: 469-567-0045

Overall 7+ years of experience in Software applications development including Analysis, Design, Development,
Integration, Testing and Maintenance of various big data applications using Scala and python languages Experienced
developing big data applications in cloud and on-premises platforms.

Technical Summary

 Experienced in building streaming applications using Apache Kafka, Spark Streaming and Other streaming platforms.
 Experienced in building highly scalable Big-data solutions using Hadoop and multiple distributions i.e., Cloudera,
Hortonworks, and NoSQL platforms (Hbase & Cassandra).
 Expertise in Big data architecture with Hadoop File system and its eco system tools MapReduce, HBase, Hive, Agile,
Pig, Zookeeper, Oozie, Flume, Avro, Impala, Apache spark and Spark Streaming and Spark SQL.
 Hands on experience in Apache Sqoop, Apache Storm and Apache Hive integration.
 Experience on multi cloud environments like Azure and Amazon Web services (AWS).
 Experience with different File Formats like Parquet, JSON, AVRO, ORC for Hive Querying and Processing.
 Developed Spark applications using Scala and Python for lot of ETL Operations and machine Learning algorithms.
 Experience in building end to end continuous integration and deployment using Jenkins.
 Familiarity with Containerizations and virtualization tools like Docker and Kubernetes.

Experience Summary

Client Entegris Inc Designation Sr. Data Engineer

Location Minneapolis, MN Duration Feb 21 - Present

 Responsible for Digital Transformation from the legacy BW to Cloud Datawarehouse.

 Worked on building different use cases for the data which was being manually maintained in different
manufacturing data warehouses.
 Core member of Cloud Analytics Selection Team for selecting a Cloud platform for which is suitable for many of
our organization use cases.
 Experienced in creating Dataproc and Dataflow clusters on GCP for running all the computations on Cloud.
 Developed CI/CD pipelines using Python, Spark and Spark-SQL for data extraction, transformation, pivoting and
aggregating to the specified format as per business requirement.
 Loading data every 15 min on incremental basis to Bigquery using Google DataProc, Pyspark, Gsutil and Shell
Script.
 Used Google Cloud Composer, Airflow for creating data pipelines automation from Cloud storage to a Bigquery.
 Experience in creating queries on Bigquery for different data sets and integrating that to Power BI for creating
dashboards.
 Using rest API with Python to ingest Data from and some other site to Bigquery.
 Created dashboard on Power BI for Data visualization and reporting for quarterly and monthly reports.
 Worked on a POC for integration of Snowflake to AWS for one of our data use case, created a demo data
pipeline using AWS S3 bucket, Glue and by using python transformations and ingested curated dataset to
Snowflake.
 Worked on POC with AWS Sagemaker for one of our ML and AI use cases and on GCP with Vertex AI for
Comparison between both the platforms.
 Expert in monitoring Bigquery, Dataproc and cloud Data flow jobs via Monitoring agent for GCP.
Client Target Corporation Inc Designation Sr. Data Engineer

Location Minneapolis, MN Duration Dec 19 – Present

 Developed external vendor file export pipelines using Spark, Hive, Python, Scala, and Shell scripting.
 Implemented optimized spark Scala data pipelines for aggregating large amounts of data.
 Worked on building on generating business reports from custom vendor data platform.
 Developed integrations pipelines for SFTP, Cloud storages like S3 and GCS.
 Developed Spark applications using Spark-SQL for data extraction, transformation and aggregating to a specified
format for transforming and analyzing the data to uncover insights into customer requested formats.
 Experienced in SQL, data transformations, statistical analysis and troubleshooting across more than one
Database Platform (MySQL, PostgreSQL, Teradata, and Azure SQL warehouse).
 Migrated existing data pipelines from Hortonworks Platform 2 to Hortonworks Platform 3.
 Implemented data pipelines automation using Oozie and internally open-sourced tools like automation portal.
 Implemented reporting layer on top of Apache Druid, for incrementally updates to business reports.
 Expert in building Hive optimized queries on top of large volumes of data in different data formats.
 Developed Continuous deployment process using container-based tools like Drone.
 Implemented Docker pipelines for testing and validation in integration and deployment process.
 Developed end to end unit testing and integration testing for data pipelines using PySpark.
 Developed daily metrics pipelines and exposed it through Grafana dashboard with alerting.

Client Samsung Electronics America Designation Sr.Data Engineer

Location Plano, TX Duration Mar 18 – Dec 19

 Implemented Real-time data pipelines for streaming analytics Using Kafka, Spark Streaming with Scala.
 Working on migrating on premise cluster data into Azure Cloud for implementing real-time features.
 Created Custom Dashboards Using Application Insights and Application Insights Query Language to process
metrics sent to AI and create dashboards on top of it in AZURE.
 Created real time streaming dashboards in Power BI using Stream Analytics to push dataset to Power Bi.
 Developed a custom message consumer to consume the data from the Kafka producer and push the messages
to service bus and event hub (Azure Components).
 Implemented Spark ETL jobs in Azure HD insights for ETL Operations in Cloud.
 Implemented CI-CD pipelines to build and deploy the projects in Hadoop environment using Jenkins.
 Implemented data platform in Hive data warehouse for on premise use and archival purpose.

Client United Airlines Designation Sr.Big Data Engineer

Location Chicago, IL Duration Oct 16 – Feb 18

 Extracted the data from Teradata & MySQL into HDFS using Sqoop export/import.
 Developed Sqoop jobs with incremental load to populate Hive External tables.
 Expertise in using design patterns in Map Reduce to convert business data into custom format.
 Experienced with handling different compression codec's like LZO, GZIP, and Snappy.
 Expert in optimizing performance in hive using partitions and bucketing concepts.
 Experience on working hive dynamic partition to overcome hive locking mechanism.
 Developed UDFs in Java as and when necessary to use in HIVE queries.
 Developed crontab for scheduling and orchestrating the ETL process.
 Involved in indexing hive data using Solr and prepare custom tokenizer formats for querying.
 Involved in designing a real time computation engine using Kafka.
 Worked on POC to set up spark streaming data to Solr and perform indexing on it.
 Experienced with writing build jobs using Maven and integrate that with Jenkins.
 Ingested data from AWS cloud buckets for third party data.

Client BCBS Designation Big Data Engineer

Location Baltimore, MD Duration Jul 15 - Sep 16

 Developed oozie automations using custom MapReduce, Pig, Hive, Sqoop.

 Built reusable Hive UDF libraries for business which enables users to reuse.
 Expertise in performance tuning on Hive queries, Joins and different configuration parameters to improve query
response time.
 Created Partitions, Buckets based on state to further process using Bucked based Hive joins.
 Used Cassandra CQL with Java API’s to retrieve data from Cassandra table.
 Developed applications on spark as part of Next gen platform implementation.
 Implemented Data Ingestion in real time processing using Kafka.
 Developed Data pipeline using Kafka and Storm to store Data into HDFS.
 Used Apache Maven extensively while developing MapReduce program.
 Extensively worked on PIG Scripts and Pig UDF’s to perform ETL activities.
 Developed spark scripts using Python.
 Developed workflow in Oozie to automate the tasks.
 Collected Logs data from web servers and loaded into HDFS using Flume.

Client Health Integrated Designation Hadoop Developer

Location Tampa, FL Duration Nov 14 – Jun 15

 Understand the exact requirement of report from the Business groups and users.
 Imported trading and derivatives data in Hadoop Distributed File System using Eco System components
MapReduce, Pig, Hive, Sqoop.
 Responsible writing Hive queries and PIG scripts for data processing.
 Running Sqoop for importing data from Oracle and another Database.
 Created of shell scripts to collect raw logs from different machines.
 Created Hive as static and dynamic partitions.
 Optimized script using illustrate and explain and used parameterize Pig Script.
 Defined some PIG UDFs for some functions such as swap, hedging, Speculation and arbitrage.
 Unstructured logs files are coded using MapReduce program.
 Imported and exported data into HDFS and Hive using Sqoop.
 Involved in the process of configuring HA, Kerberos security issues and name node failure restoration activity
time to time as a part of zero downtime.
 Developed JUNIT test cases for application unit testing.
 Used SVN as version control to check in the code, created branches and tagged the code in SVN.
Client AdvanSoft International Designation Java/J2EE Developer

Location India Duration Jun 13 – May 14

 Developed the modules based on Struts MVC Architecture.

 Developed business components using Core Java concepts and classes like Inheritance, Polymorphism,
Collections, Serialization and Multithreading etc.
 Developed the Web Interface using Servlets, Java Server Pages, HTML and CSS.
 Developed the DAO objects using JDBC.
 Used Spring Framework for Dependency injection and integrated with the Struts Framework and Hibernate.
 Used Log4j to capture the log that includes runtime exceptions, monitored error logs, and fixed the problems.
 Performed Unit Testing, System Testing, and Integration Testing.
 Provided technical support for production environments resolving the issues, analyzing the defects, providing,
and implementing the solution defects.

Azure Data Engineer - Samatha Gudala
100% (1)
Azure Data Engineer - Samatha Gudala
8 pages
NandanaReddy SrDataEngineer
No ratings yet
NandanaReddy SrDataEngineer
5 pages
Aravind - Senior Azure Data Engineer
No ratings yet
Aravind - Senior Azure Data Engineer
5 pages
Mahesh D: Senior AWS Big Data Engineer
No ratings yet
Mahesh D: Senior AWS Big Data Engineer
5 pages
RAJU AWS Data Engineer Resume
No ratings yet
RAJU AWS Data Engineer Resume
6 pages
Senior AWS Big Data Engineer Profile
No ratings yet
Senior AWS Big Data Engineer Profile
9 pages
Data Engineering Expertise Overview
No ratings yet
Data Engineering Expertise Overview
7 pages
Data Engineering Expert Profile
No ratings yet
Data Engineering Expert Profile
5 pages
BI Developer Resume: ETL & Data Warehousing
0% (1)
BI Developer Resume: ETL & Data Warehousing
2 pages
DTCC Hyderabad Office Location Guide
No ratings yet
DTCC Hyderabad Office Location Guide
6 pages
Rakesh Prasad: BI & Analytics Resume
No ratings yet
Rakesh Prasad: BI & Analytics Resume
6 pages
Pranjal Soni: Professional Summary
No ratings yet
Pranjal Soni: Professional Summary
4 pages
ABHINAY VARMA PINNAMARAJU - Data Engineering
No ratings yet
ABHINAY VARMA PINNAMARAJU - Data Engineering
6 pages
Deepak (Sr. Data Engineer)
No ratings yet
Deepak (Sr. Data Engineer)
10 pages
Saikiran Data - Engineer Resume
No ratings yet
Saikiran Data - Engineer Resume
7 pages
Azure Data Engineer Resume
100% (1)
Azure Data Engineer Resume
2 pages
Data Engineering Expertise Overview
No ratings yet
Data Engineering Expertise Overview
8 pages
Srikanth M - Data Engineer
No ratings yet
Srikanth M - Data Engineer
5 pages
Data Engineering & Science CV
100% (2)
Data Engineering & Science CV
2 pages
Ravali Data Engineer GCP
No ratings yet
Ravali Data Engineer GCP
8 pages
Vinay Kumar Data Engineer
No ratings yet
Vinay Kumar Data Engineer
8 pages
Data Analyst with 7+ Years Experience
No ratings yet
Data Analyst with 7+ Years Experience
3 pages
Senior Data Engineer Resume SEO
100% (1)
Senior Data Engineer Resume SEO
4 pages
Elizabeth: ETL Informatica Developer
No ratings yet
Elizabeth: ETL Informatica Developer
5 pages
Resume-Senior Data Engineer-Etihad Airways-Kashish Suri
No ratings yet
Resume-Senior Data Engineer-Etihad Airways-Kashish Suri
4 pages
Azure Data Engineer Resume SEO
No ratings yet
Azure Data Engineer Resume SEO
6 pages
Arvind Chaudhary: Snowpro Certified Developer
No ratings yet
Arvind Chaudhary: Snowpro Certified Developer
6 pages
Senior ETL Developer Resume Overview
No ratings yet
Senior ETL Developer Resume Overview
7 pages
Chandana - Azure Data Engineer
0% (1)
Chandana - Azure Data Engineer
7 pages
Laxmancibi Sivakumar Databricks Resume
No ratings yet
Laxmancibi Sivakumar Databricks Resume
5 pages
Cloud Bigdata Amand AWS
No ratings yet
Cloud Bigdata Amand AWS
6 pages
Azure Data Engineer Resume - Hire IT People - We Get IT Done
100% (1)
Azure Data Engineer Resume - Hire IT People - We Get IT Done
4 pages
Srilakshmi ADE Resume
No ratings yet
Srilakshmi ADE Resume
4 pages
Azure Resume 3
No ratings yet
Azure Resume 3
8 pages
AWS Data Engineer Resume
No ratings yet
AWS Data Engineer Resume
1 page
Azure Data Engineer Content
No ratings yet
Azure Data Engineer Content
6 pages
Snowflake Data Engineer Resume Summary
No ratings yet
Snowflake Data Engineer Resume Summary
4 pages
Zclus - Harish - Data Engineer
No ratings yet
Zclus - Harish - Data Engineer
6 pages
Lead Data Engineer with AWS Expertise
No ratings yet
Lead Data Engineer with AWS Expertise
2 pages
Azure Data Engineer - Updated Profile - Raaman
No ratings yet
Azure Data Engineer - Updated Profile - Raaman
4 pages
Big Data & Hadoop Developer Resume
No ratings yet
Big Data & Hadoop Developer Resume
8 pages
Praveen Kumar Kodumuru: Summary
No ratings yet
Praveen Kumar Kodumuru: Summary
4 pages
Durgesh Sr. Data Architect / Modeler/Bigdata
100% (1)
Durgesh Sr. Data Architect / Modeler/Bigdata
5 pages
Big Data & DWH-BI Architect Profile
No ratings yet
Big Data & DWH-BI Architect Profile
8 pages
Venkateswarlu Snowflake
No ratings yet
Venkateswarlu Snowflake
3 pages
Teja
No ratings yet
Teja
5 pages
Data Warehousing ETL Specialist Resume
No ratings yet
Data Warehousing ETL Specialist Resume
6 pages
Siva
No ratings yet
Siva
4 pages
Snowflake CV 9
No ratings yet
Snowflake CV 9
4 pages
Muthyam Resume
No ratings yet
Muthyam Resume
2 pages
Ajay Kadiyala Resume 2023 PDF
No ratings yet
Ajay Kadiyala Resume 2023 PDF
6 pages
Sr Data Engineer with AWS & Hadoop Expertise
No ratings yet
Sr Data Engineer with AWS & Hadoop Expertise
7 pages
Dhanush Bigdata Resume Updated
No ratings yet
Dhanush Bigdata Resume Updated
9 pages
Data Engineer Resume
No ratings yet
Data Engineer Resume
1 page
Dice Resume CV Karthik S
No ratings yet
Dice Resume CV Karthik S
4 pages
1
No ratings yet
1
6 pages
Kiran Resume
No ratings yet
Kiran Resume
6 pages
Abdul Kareem Syed
No ratings yet
Abdul Kareem Syed
5 pages
Hadoop and AWS S3 Integration Insights
No ratings yet
Hadoop and AWS S3 Integration Insights
6 pages
Senior Big Data Developer Profile
No ratings yet
Senior Big Data Developer Profile
6 pages
EXECUTIVE SUMMARY - Efilling Tax Done ...
No ratings yet
EXECUTIVE SUMMARY - Efilling Tax Done ...
57 pages
Ash 2025
No ratings yet
Ash 2025
2 pages
Business Analytics Project
No ratings yet
Business Analytics Project
52 pages
01 Chapter 1
No ratings yet
01 Chapter 1
19 pages
Inventory ..2
No ratings yet
Inventory ..2
13 pages
Senior Technology Architect Resume
No ratings yet
Senior Technology Architect Resume
6 pages
Oracle EBS & ERP Cloud Consultant
No ratings yet
Oracle EBS & ERP Cloud Consultant
4 pages
Java Full Stack Developer Profile
No ratings yet
Java Full Stack Developer Profile
7 pages
Houston Finance Professional Resume
No ratings yet
Houston Finance Professional Resume
3 pages
Nanthakumar Sekar 5years CV
No ratings yet
Nanthakumar Sekar 5years CV
1 page
Saraswati K DA
No ratings yet
Saraswati K DA
6 pages
Locke Jeffrey CV v1
No ratings yet
Locke Jeffrey CV v1
3 pages
Ahmed Mohd
No ratings yet
Ahmed Mohd
12 pages
Microservices on AWS by Amardeep Singh
No ratings yet
Microservices on AWS by Amardeep Singh
8 pages
DevOps & Cloud Infrastructure Expert
No ratings yet
DevOps & Cloud Infrastructure Expert
1 page
Ifhaam Java Developer GC Candidate
No ratings yet
Ifhaam Java Developer GC Candidate
4 pages
Oracle EPM & Hyperion Expert Profile
No ratings yet
Oracle EPM & Hyperion Expert Profile
6 pages
Abdullah Sap-Bw Bobj - Bi
No ratings yet
Abdullah Sap-Bw Bobj - Bi
6 pages
Osora Nzeribe Resume
No ratings yet
Osora Nzeribe Resume
5 pages
Retailer GST Project
No ratings yet
Retailer GST Project
5 pages
Dice Resume CV Franklin Nsude
No ratings yet
Dice Resume CV Franklin Nsude
6 pages
Neha Singh: SQL BI Developer Resume
No ratings yet
Neha Singh: SQL BI Developer Resume
8 pages
DevOps & Cloud Engineer Profile
No ratings yet
DevOps & Cloud Engineer Profile
4 pages
Dice Resume CV Al Kazendar
No ratings yet
Dice Resume CV Al Kazendar
8 pages
DevOps & Cloud Expert Profile
No ratings yet
DevOps & Cloud Expert Profile
6 pages
Dice Resume CV Likitha Pailla
No ratings yet
Dice Resume CV Likitha Pailla
5 pages
Sai Prakash - JAVA
No ratings yet
Sai Prakash - JAVA
5 pages
Splunk Guide To Operational Intelligence
No ratings yet
Splunk Guide To Operational Intelligence
17 pages
Hadoop Architecture Overview and Functions
No ratings yet
Hadoop Architecture Overview and Functions
7 pages
PySpark Comprehensive Notes
No ratings yet
PySpark Comprehensive Notes
59 pages
Big Data Exam Questions and Answers
No ratings yet
Big Data Exam Questions and Answers
8 pages
02 - Campus Network Intelligent O&M and CampusInsight
No ratings yet
02 - Campus Network Intelligent O&M and CampusInsight
59 pages
Practical 6 (BDA) ETI
No ratings yet
Practical 6 (BDA) ETI
3 pages
Dremio vs. SQL Engines: Benchmark Insights
No ratings yet
Dremio vs. SQL Engines: Benchmark Insights
57 pages
Using Map Reduce Concept, Implement A Java Pro...
No ratings yet
Using Map Reduce Concept, Implement A Java Pro...
2 pages
Unit V-HBase
No ratings yet
Unit V-HBase
10 pages
Big Data Analytics Lab Exam Guide
No ratings yet
Big Data Analytics Lab Exam Guide
2 pages
Iot Unit 4
No ratings yet
Iot Unit 4
28 pages
Cc-Unit-1
No ratings yet
Cc-Unit-1
151 pages
Data Engineering Cookbook Overview
100% (1)
Data Engineering Cookbook Overview
125 pages
PDF - NSM - NSM Rule Book PG Diploma in HPC March 2024
No ratings yet
PDF - NSM - NSM Rule Book PG Diploma in HPC March 2024
24 pages
Bda Manual
No ratings yet
Bda Manual
33 pages
Big Data and Blockchain Basics: Dr. Poonam Saini Poonamsaini@pec - Edu.in
No ratings yet
Big Data and Blockchain Basics: Dr. Poonam Saini Poonamsaini@pec - Edu.in
42 pages
GCP - Data - Engineering - Certification
No ratings yet
GCP - Data - Engineering - Certification
219 pages
Cloudera Impala Overview and Features
No ratings yet
Cloudera Impala Overview and Features
11 pages
Soham Sarkar: Education
No ratings yet
Soham Sarkar: Education
1 page
Block-2-Unit 5
No ratings yet
Block-2-Unit 5
101 pages
Evolution of Cloud Storage Systems
No ratings yet
Evolution of Cloud Storage Systems
46 pages
Rahul N Sr. Data Engineer
No ratings yet
Rahul N Sr. Data Engineer
6 pages
AWS Amazon Interview Question and Answers
0% (1)
AWS Amazon Interview Question and Answers
55 pages
Using Big Data Analytics in The Field of Agriculture A Survey
No ratings yet
Using Big Data Analytics in The Field of Agriculture A Survey
3 pages
Hitachi White Paper Big Data Infrastructure
No ratings yet
Hitachi White Paper Big Data Infrastructure
9 pages
Cloud Application Development
No ratings yet
Cloud Application Development
21 pages
Social Network Analysis PhD Proposal
No ratings yet
Social Network Analysis PhD Proposal
45 pages
6th Sem Syllabus
No ratings yet
6th Sem Syllabus
8 pages
Spark vs. MapReduce: Performance Analysis
No ratings yet
Spark vs. MapReduce: Performance Analysis
7 pages
HBase Data Model Overview
No ratings yet
HBase Data Model Overview
14 pages

Big Data Engineer Resume Overview

Uploaded by

Big Data Engineer Resume Overview

Uploaded by

Vishwa | Sr. Big Data Engineer | vishwas.bigdata@gmail.

com| Phone: 469-567-0045

Client Entegris Inc Designation Sr. Data Engineer

Location Minneapolis, MN Duration Feb 21 - Present

 Responsible for Digital Transformation from the legacy BW to Cloud Datawarehouse.

Location Minneapolis, MN Duration Dec 19 – Present

Client Samsung Electronics America Designation Sr.Data Engineer

Location Plano, TX Duration Mar 18 – Dec 19

Client United Airlines Designation Sr.Big Data Engineer

Location Chicago, IL Duration Oct 16 – Feb 18

Client BCBS Designation Big Data Engineer

Location Baltimore, MD Duration Jul 15 - Sep 16

 Developed oozie automations using custom MapReduce, Pig, Hive, Sqoop.

Client Health Integrated Designation Hadoop Developer

Location Tampa, FL Duration Nov 14 – Jun 15

Location India Duration Jun 13 – May 14

 Developed the modules based on Struts MVC Architecture.

You might also like