Adam Dziedzic

Adam Dziedzic

Toronto, Ontario, Canada
2K followers 500+ connections

About

Researcher, Engineer, Software Developer, Systems Designer.

My motto: Design and…

Articles by Adam

Activity

Join now to see all activity

Experience

  • CISPA Helmholtz Center for Information Security Graphic
  • -

    Toronto, Ontario, Canada

  • -

    Toronto, Ontario, Canada

  • -

    Greater Chicago Area

  • -

    main: Madison, Wisconsin, intro week: New York, PIRC conference: Mountain View

  • -

    Redmond

  • -

    Lausanne Area, Switzerland

  • -

    London

  • -

    Geneva Area, Switzerland

  • -

    Warsaw, Masovian District, Poland

  • -

    Warsaw, Masovian District, Poland

  • -

    Warsaw, Masovian District, Poland

Education

  • University of Chicago Graphic
  • -

    -

    Computer systems architecture, software engineering and information systems.

    Dissertation titled: "Analysis and comparison of NoSQL databases with an example of application using CouchDB."

  • -

    -

  • -

    -

Licenses & Certifications

Publications

  • CaPC Learning: Confidential and Private Collaborative Learning

    ICLR 2021

    Machine learning benefits from large training datasets, which may not always be possible to collect by any single entity, especially when using privacy-sensitive data. In many contexts, such as healthcare and finance, separate parties may wish to collaborate and learn from each other's data but are prevented from doing so due to privacy regulations. Some regulations prevent explicit sharing of data between parties by joining datasets in a central location (confidentiality). Others also limit…

    Machine learning benefits from large training datasets, which may not always be possible to collect by any single entity, especially when using privacy-sensitive data. In many contexts, such as healthcare and finance, separate parties may wish to collaborate and learn from each other's data but are prevented from doing so due to privacy regulations. Some regulations prevent explicit sharing of data between parties by joining datasets in a central location (confidentiality). Others also limit implicit sharing of data, e.g., through model predictions (privacy). There is currently no method that enables machine learning in such a setting, where both confidentiality and privacy need to be preserved, to prevent both explicit and implicit sharing of data. Federated learning only provides confidentiality, not privacy, since gradients shared still contain private information. Differentially private learning assumes unreasonably large datasets. Furthermore, both of these learning paradigms produce a central model whose architecture was previously agreed upon by all parties rather than enabling collaborative learning where each party learns and improves their own local model. We introduce Confidential and Private Collaborative (CaPC) learning, the first method provably achieving both confidentiality and privacy in a collaborative setting. We leverage secure multi-party computation (MPC), homomorphic encryption (HE), and other techniques in combination with privately aggregated teacher models. We demonstrate how CaPC allows participants to collaborate without having to explicitly join their training sets or train a central model. Each party is able to improve the accuracy and fairness of their model, even in settings where each party has a model that performs well on their own dataset or when datasets are not IID and model architectures are heterogeneous across parties.

    See publication
  • Pretrained Transformers Improve Out-of-Distribution Robustness

    ACL 2020

    Although pretrained Transformers such as BERT achieve high accuracy on in-distribution examples, do they generalize to new distributions? We systematically measure out-of-distribution (OOD) generalization for seven NLP datasets by constructing a new robustness benchmark with realistic distribution shifts. We measure the generalization of previous models including bag-of-words models, ConvNets, and LSTMs, and we show that pretrained Transformers' performance declines are substantially smaller…

    Although pretrained Transformers such as BERT achieve high accuracy on in-distribution examples, do they generalize to new distributions? We systematically measure out-of-distribution (OOD) generalization for seven NLP datasets by constructing a new robustness benchmark with realistic distribution shifts. We measure the generalization of previous models including bag-of-words models, ConvNets, and LSTMs, and we show that pretrained Transformers' performance declines are substantially smaller. Pretrained transformers are also more effective at detecting anomalous or OOD examples, while many previous models are frequently worse than chance. We examine which factors affect robustness, finding that larger models are not necessarily more robust, distillation can be harmful, and more diverse pretraining data can enhance robustness. Finally, we show where future work can improve OOD robustness.

    See publication
  • Band-limited Training and Inference for Convolutional Neural Networks

    ICML 2019

    The convolutional layers are core building blocks of neural network architectures. In general, a convolutional filter applies to the entire frequency spectrum of the input data. We explore artificially constraining the frequency spectra of these filters and data, called band-limiting, during training. The frequency domain constraints apply to both the feed-forward and back-propagation steps. Experimentally, we observe that Convolutional Neural Networks (CNNs) are resilient to this compression…

    The convolutional layers are core building blocks of neural network architectures. In general, a convolutional filter applies to the entire frequency spectrum of the input data. We explore artificially constraining the frequency spectra of these filters and data, called band-limiting, during training. The frequency domain constraints apply to both the feed-forward and back-propagation steps. Experimentally, we observe that Convolutional Neural Networks (CNNs) are resilient to this compression scheme and results suggest that CNNs learn to leverage lower-frequency components. In particular, we found: (1) band-limited training can effectively control the resource usage (GPU and memory); (2) models trained with band-limited layers retain high prediction accuracy; and (3) requires no modification to existing training algorithms or neural network architectures to use unlike other compression schemes.

    See publication
  • Columnstore and B+ tree - Are Hybrid Physical Designs Important?

    SIGMOD 2018

    Commercial DBMSs, such as Microsoft SQL Server, cater to diverse workloads including transaction processing, decision support, and operational analytics. They also support variety in physical design structures such as B+ tree and columnstore. The benefits of B+tree for OLTP workloads and columnstore for decision support workloads are well-understood. However, the importance of hybrid physical designs, consisting of both columnstore and B+ tree indexes on the same database, is not well-studied -…

    Commercial DBMSs, such as Microsoft SQL Server, cater to diverse workloads including transaction processing, decision support, and operational analytics. They also support variety in physical design structures such as B+ tree and columnstore. The benefits of B+tree for OLTP workloads and columnstore for decision support workloads are well-understood. However, the importance of hybrid physical designs, consisting of both columnstore and B+ tree indexes on the same database, is not well-studied - a focus of this paper. We first quantify the trade-offs using carefully-crafted micro-benchmarks. This micro-benchmarking indicates that hybrid physical designs can result in orders of magnitude better performance depending on the workload. For complex real-world applications, choosing an appropriate combination of columnstore and B+ tree indexes for a database workload is challenging. We extend the Database Engine Tuning Advisor for Microsoft SQL Server to recommend a suitable combination of B+ tree and columnstore indexes for a given workload. Through extensive experiments using industry-standard benchmarks and several real-world customer workloads, we quantify how a physical design tool capable of recommending hybrid physical designs can result in orders of magnitude better execution costs compared to approaches that rely either on columnstore-only or B+ tree-only designs

    See publication

Courses

  • Advanced C++ Programming

    -

  • Advanced Databases (Spatial, temporal, spatio-temporal and multimedia databases)

    02288

  • Advanced Distributed Systems

    CMSC 23310 01

  • Advanced Operating Systems

    CMSC 33100 01

  • Algorithms

    CMSC 37000 01

  • Analysis and Design of Information Systems

    -

  • Analysis of Algorithms

    -

  • Applied statistics and statistical software

    02441

  • Computer Architecture

    CMSC 32200 1

  • Convex Optimization

    BUSF 36903 / STAT 31015

  • Data Bases 2

    -

  • Data Mining

    -

  • Discrete Mathematics

    CMSC 37110 01

  • Introduction to Databases

    CMSC 33550 01

  • Introduction to Mobile Applications Development

    -

  • Introduction to Statistical Machine Learning

    TTIC 31020 1

  • Java Programming

    02115

  • Knowledge Discovery Methods (Machine learning)

    -

  • Knowledge Engineering

    -

  • Logical Systems and Logic Programming

    02156

  • Mathematical Foundations of Machine Learning

    CMSC 35300 2

  • Mathematical Toolkit

    TTIC 31150-1 / CMSC 31150

  • Modern Heuristic Techniques

    -

  • Oracle System Architecture and Data Base Administration

    -

  • Python: Advanced Hands-On (32h)

    -

  • SNN (Neural Nets)

    -

  • Software Engineering

    -

  • Speech Technologies

    CMSC 35110 1

  • Team Building

    -

  • Theoretical Basics of Cryptography

    -

  • Topics in Databases

    CMCS 33501

  • Web 2.0 and mobile interactions

    02815

Projects

  • Soundpie - your music in restaurants & at parties

    Well, a typical party. The current song terminates, yet parties hate silence, so guests try to play their cherished songs from the Internet. That forms a typical "YouTube queue" which we'd like to eschew by using our product. What's more, we'd like to provide you with your favourite music in every restaurant that you happen to enter.

    Our fully functional application prototype was presented at Startup Sauna Warsaw and qualified for Warsaw final round. It integrated last.fm, OpenID…

    Well, a typical party. The current song terminates, yet parties hate silence, so guests try to play their cherished songs from the Internet. That forms a typical "YouTube queue" which we'd like to eschew by using our product. What's more, we'd like to provide you with your favourite music in every restaurant that you happen to enter.

    Our fully functional application prototype was presented at Startup Sauna Warsaw and qualified for Warsaw final round. It integrated last.fm, OpenID, Youtube and Google Maps services through their API-s, was running on Android devices (clients) as well as on Google App Engine (server).

    Other creators
  • Work Placer: a Mobile Web 2.0 Application for DTU students

    - Present

    We designed a smartphone DTU 2.0 widget which integrated the Web 2.0 design patterns and applied aspects of augmented reality.

    Other creators
  • Document management system

    -

    Document management system as an application in three-tiered architecture created in the framework of my bachelor of science dissertation.

  • People Placer: a Mobile Web 2.0 Application for DTU students

    -

    We designed and partially implemented a smartphone widget that simply finds people. It takes the advantage of a university campus and is fully integrated with the context - in this case with DTU campus. To facilitate collective use we had to devise external web service and achieved this by using Google App Engine. It incorporates declarative living and synchronized web concepts. Technology used: HTML, CSS, JavaScript, Google Maps API, iCalendar interface, Facebook API, Python, Google App Engine

    Other creators
  • Music Picker: a Mobile Web 2.0 Application for DTU students

    -

    We designed a smartphone widget which provides aspects of music social network interaction, helps a potential user mine information on listening habits, discover new artists, declare participation in upcoming concerts and even gives them an intelligent agent to adjust music to their daily activities. Technology used: HTML, CSS, JavaScript, Google Maps API, last.fm API.

    Other creators

Honors & Awards

  • Scholarship for the best faculty students in 2011

    Warsaw Univerity of Technology

    Based on grades

  • Scholarship for the best faculty students in 2010

    Warsaw Univeristy of Technology

    Based on grades

  • Scholarship for the best faculty students in 2009

    Warsaw University of Technology

    Based on grades

  • Scholarship for the best faculty students in 2008

    Warsaw University of Technology

    Based on grades

Languages

  • English

    Professional working proficiency

  • Polish

    Native or bilingual proficiency

  • French

    Elementary proficiency

More activity by Adam

View Adam’s full profile

  • See who you know in common
  • Get introduced
  • Contact Adam directly
Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Add new skills with these courses