About
Researcher, Engineer, Software Developer, Systems Designer.
My motto: Design and…
Articles by Adam
Activity
-
🧑🏽🎓 Applications Open: Interdisciplinary Research Incubator ❗ Find out more and apply till Oct 29th: https://lnkd.in/dG4WBDva ❗ AI is…
🧑🏽🎓 Applications Open: Interdisciplinary Research Incubator ❗ Find out more and apply till Oct 29th: https://lnkd.in/dG4WBDva ❗ AI is…
Liked by Adam Dziedzic
-
CISPA Innovation Campus to be built in Stadt St. Ingbert – with private funding CISPA and the city of St. Ingbert have further developed their plans…
CISPA Innovation Campus to be built in Stadt St. Ingbert – with private funding CISPA and the city of St. Ingbert have further developed their plans…
Liked by Adam Dziedzic
-
A very pleasant and constructive exchange with Peter Tschentscher, First Mayor of the Free and Hanseatic City of Hamburg, about the role of research…
A very pleasant and constructive exchange with Peter Tschentscher, First Mayor of the Free and Hanseatic City of Hamburg, about the role of research…
Liked by Adam Dziedzic
Experience
Education
Licenses & Certifications
Publications
-
CaPC Learning: Confidential and Private Collaborative Learning
ICLR 2021
See publicationMachine learning benefits from large training datasets, which may not always be possible to collect by any single entity, especially when using privacy-sensitive data. In many contexts, such as healthcare and finance, separate parties may wish to collaborate and learn from each other's data but are prevented from doing so due to privacy regulations. Some regulations prevent explicit sharing of data between parties by joining datasets in a central location (confidentiality). Others also limit…
Machine learning benefits from large training datasets, which may not always be possible to collect by any single entity, especially when using privacy-sensitive data. In many contexts, such as healthcare and finance, separate parties may wish to collaborate and learn from each other's data but are prevented from doing so due to privacy regulations. Some regulations prevent explicit sharing of data between parties by joining datasets in a central location (confidentiality). Others also limit implicit sharing of data, e.g., through model predictions (privacy). There is currently no method that enables machine learning in such a setting, where both confidentiality and privacy need to be preserved, to prevent both explicit and implicit sharing of data. Federated learning only provides confidentiality, not privacy, since gradients shared still contain private information. Differentially private learning assumes unreasonably large datasets. Furthermore, both of these learning paradigms produce a central model whose architecture was previously agreed upon by all parties rather than enabling collaborative learning where each party learns and improves their own local model. We introduce Confidential and Private Collaborative (CaPC) learning, the first method provably achieving both confidentiality and privacy in a collaborative setting. We leverage secure multi-party computation (MPC), homomorphic encryption (HE), and other techniques in combination with privately aggregated teacher models. We demonstrate how CaPC allows participants to collaborate without having to explicitly join their training sets or train a central model. Each party is able to improve the accuracy and fairness of their model, even in settings where each party has a model that performs well on their own dataset or when datasets are not IID and model architectures are heterogeneous across parties.
-
Pretrained Transformers Improve Out-of-Distribution Robustness
ACL 2020
See publicationAlthough pretrained Transformers such as BERT achieve high accuracy on in-distribution examples, do they generalize to new distributions? We systematically measure out-of-distribution (OOD) generalization for seven NLP datasets by constructing a new robustness benchmark with realistic distribution shifts. We measure the generalization of previous models including bag-of-words models, ConvNets, and LSTMs, and we show that pretrained Transformers' performance declines are substantially smaller…
Although pretrained Transformers such as BERT achieve high accuracy on in-distribution examples, do they generalize to new distributions? We systematically measure out-of-distribution (OOD) generalization for seven NLP datasets by constructing a new robustness benchmark with realistic distribution shifts. We measure the generalization of previous models including bag-of-words models, ConvNets, and LSTMs, and we show that pretrained Transformers' performance declines are substantially smaller. Pretrained transformers are also more effective at detecting anomalous or OOD examples, while many previous models are frequently worse than chance. We examine which factors affect robustness, finding that larger models are not necessarily more robust, distillation can be harmful, and more diverse pretraining data can enhance robustness. Finally, we show where future work can improve OOD robustness.
-
Band-limited Training and Inference for Convolutional Neural Networks
ICML 2019
See publicationThe convolutional layers are core building blocks of neural network architectures. In general, a convolutional filter applies to the entire frequency spectrum of the input data. We explore artificially constraining the frequency spectra of these filters and data, called band-limiting, during training. The frequency domain constraints apply to both the feed-forward and back-propagation steps. Experimentally, we observe that Convolutional Neural Networks (CNNs) are resilient to this compression…
The convolutional layers are core building blocks of neural network architectures. In general, a convolutional filter applies to the entire frequency spectrum of the input data. We explore artificially constraining the frequency spectra of these filters and data, called band-limiting, during training. The frequency domain constraints apply to both the feed-forward and back-propagation steps. Experimentally, we observe that Convolutional Neural Networks (CNNs) are resilient to this compression scheme and results suggest that CNNs learn to leverage lower-frequency components. In particular, we found: (1) band-limited training can effectively control the resource usage (GPU and memory); (2) models trained with band-limited layers retain high prediction accuracy; and (3) requires no modification to existing training algorithms or neural network architectures to use unlike other compression schemes.
-
Columnstore and B+ tree - Are Hybrid Physical Designs Important?
SIGMOD 2018
See publicationCommercial DBMSs, such as Microsoft SQL Server, cater to diverse workloads including transaction processing, decision support, and operational analytics. They also support variety in physical design structures such as B+ tree and columnstore. The benefits of B+tree for OLTP workloads and columnstore for decision support workloads are well-understood. However, the importance of hybrid physical designs, consisting of both columnstore and B+ tree indexes on the same database, is not well-studied -…
Commercial DBMSs, such as Microsoft SQL Server, cater to diverse workloads including transaction processing, decision support, and operational analytics. They also support variety in physical design structures such as B+ tree and columnstore. The benefits of B+tree for OLTP workloads and columnstore for decision support workloads are well-understood. However, the importance of hybrid physical designs, consisting of both columnstore and B+ tree indexes on the same database, is not well-studied - a focus of this paper. We first quantify the trade-offs using carefully-crafted micro-benchmarks. This micro-benchmarking indicates that hybrid physical designs can result in orders of magnitude better performance depending on the workload. For complex real-world applications, choosing an appropriate combination of columnstore and B+ tree indexes for a database workload is challenging. We extend the Database Engine Tuning Advisor for Microsoft SQL Server to recommend a suitable combination of B+ tree and columnstore indexes for a given workload. Through extensive experiments using industry-standard benchmarks and several real-world customer workloads, we quantify how a physical design tool capable of recommending hybrid physical designs can result in orders of magnitude better execution costs compared to approaches that rely either on columnstore-only or B+ tree-only designs
Courses
-
Advanced C++ Programming
-
-
Advanced Databases (Spatial, temporal, spatio-temporal and multimedia databases)
02288
-
Advanced Distributed Systems
CMSC 23310 01
-
Advanced Operating Systems
CMSC 33100 01
-
Algorithms
CMSC 37000 01
-
Analysis and Design of Information Systems
-
-
Analysis of Algorithms
-
-
Applied statistics and statistical software
02441
-
Computer Architecture
CMSC 32200 1
-
Convex Optimization
BUSF 36903 / STAT 31015
-
Data Bases 2
-
-
Data Mining
-
-
Discrete Mathematics
CMSC 37110 01
-
Introduction to Databases
CMSC 33550 01
-
Introduction to Mobile Applications Development
-
-
Introduction to Statistical Machine Learning
TTIC 31020 1
-
Java Programming
02115
-
Knowledge Discovery Methods (Machine learning)
-
-
Knowledge Engineering
-
-
Logical Systems and Logic Programming
02156
-
Mathematical Foundations of Machine Learning
CMSC 35300 2
-
Mathematical Toolkit
TTIC 31150-1 / CMSC 31150
-
Modern Heuristic Techniques
-
-
Oracle System Architecture and Data Base Administration
-
-
Python: Advanced Hands-On (32h)
-
-
SNN (Neural Nets)
-
-
Software Engineering
-
-
Speech Technologies
CMSC 35110 1
-
Team Building
-
-
Theoretical Basics of Cryptography
-
-
Topics in Databases
CMCS 33501
-
Web 2.0 and mobile interactions
02815
Projects
-
Soundpie - your music in restaurants & at parties
Well, a typical party. The current song terminates, yet parties hate silence, so guests try to play their cherished songs from the Internet. That forms a typical "YouTube queue" which we'd like to eschew by using our product. What's more, we'd like to provide you with your favourite music in every restaurant that you happen to enter.
Our fully functional application prototype was presented at Startup Sauna Warsaw and qualified for Warsaw final round. It integrated last.fm, OpenID…Well, a typical party. The current song terminates, yet parties hate silence, so guests try to play their cherished songs from the Internet. That forms a typical "YouTube queue" which we'd like to eschew by using our product. What's more, we'd like to provide you with your favourite music in every restaurant that you happen to enter.
Our fully functional application prototype was presented at Startup Sauna Warsaw and qualified for Warsaw final round. It integrated last.fm, OpenID, Youtube and Google Maps services through their API-s, was running on Android devices (clients) as well as on Google App Engine (server).Other creators -
Document management system
-
Document management system as an application in three-tiered architecture created in the framework of my bachelor of science dissertation.
-
People Placer: a Mobile Web 2.0 Application for DTU students
-
We designed and partially implemented a smartphone widget that simply finds people. It takes the advantage of a university campus and is fully integrated with the context - in this case with DTU campus. To facilitate collective use we had to devise external web service and achieved this by using Google App Engine. It incorporates declarative living and synchronized web concepts. Technology used: HTML, CSS, JavaScript, Google Maps API, iCalendar interface, Facebook API, Python, Google App Engine
Other creators -
Music Picker: a Mobile Web 2.0 Application for DTU students
-
We designed a smartphone widget which provides aspects of music social network interaction, helps a potential user mine information on listening habits, discover new artists, declare participation in upcoming concerts and even gives them an intelligent agent to adjust music to their daily activities. Technology used: HTML, CSS, JavaScript, Google Maps API, last.fm API.
Other creators
Honors & Awards
-
Scholarship for the best faculty students in 2011
Warsaw Univerity of Technology
Based on grades
-
Scholarship for the best faculty students in 2010
Warsaw Univeristy of Technology
Based on grades
-
Scholarship for the best faculty students in 2009
Warsaw University of Technology
Based on grades
-
Scholarship for the best faculty students in 2008
Warsaw University of Technology
Based on grades
Languages
-
English
Professional working proficiency
-
Polish
Native or bilingual proficiency
-
French
Elementary proficiency
More activity by Adam
-
CISPA and Technische Universität München (Technical University of Munich) are deepening their collaboration through joint initiatives to promote…
CISPA and Technische Universität München (Technical University of Munich) are deepening their collaboration through joint initiatives to promote…
Liked by Adam Dziedzic
-
September has begun, ushering in a period full of new opportunities for CISPA. I used the weeks of my vacation to reflect on the past years, take…
September has begun, ushering in a period full of new opportunities for CISPA. I used the weeks of my vacation to reflect on the past years, take…
Liked by Adam Dziedzic
-
Big Congratulations to our interns at CISPA! This is an amazing achievement and we’re looking forward to more stories from Beijing!
Big Congratulations to our interns at CISPA! This is an amazing achievement and we’re looking forward to more stories from Beijing!
Shared by Adam Dziedzic
-
Federal Chancellor visits CISPA Helmholtz-Zentrum für Informationssicherheit – a strong signal for digital sovereignty and trustworthy…
Federal Chancellor visits CISPA Helmholtz-Zentrum für Informationssicherheit – a strong signal for digital sovereignty and trustworthy…
Liked by Adam Dziedzic
Other similar profiles
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore More