Anton Baumann

Anton Baumann

München, Bayern, Deutschland
406 Follower:innen 386 Kontakte

Info

I’m a Master’s student at TUM, conducting my thesis at ETH Zürich (Krause Lab) on…

Berufserfahrung

  • Aalto University Grafik

    Aalto University

    Stockholm, Stockholm County, Sweden

  • -

    Metropolregion Helsinki

  • -

    Munich, Bavaria, Germany

  • -

    München, Bayern, Deutschland

  • -

  • -

    München, Bayern

  • -

    München und Umgebung, Deutschland

Ausbildung

  • ETH Zürich Grafik

    ETH Zürich

    –Heute

    Supervised by Prof. Andreas Krause and Prof. Zeynep Akata; day-to-day supervision by Jonas Hübotter

  • Focused coursework in Robotics, Advanced Deep Learning, Reinforcement Learning, Applied GPU Programming; Swedish A1

  • -

    Thesis: “EGFX: Determining Optical Flow on Arbitrary Surfaces – An Extended Horn & Schunck Approach for Electrographic Flow Mapping” (Grade 1.0, supervised by Prof. Daniel Cremers)

  • -

    Early-admission program for gifted high-school students; completed courses in Databases and Linear Algebra

  • -

    Activities and Societies: bigband, orchestra, rowing team

Ehrenamt

  • StarCode e.V. Grafik

    Teaching, Website Development

    StarCode e.V.

    1 Jahr 6 Monate

    Wissenschaft und Technologie

    Starcode offers free programming courses for girls between the ages of 15 and 19 to introduce them to the many possibilities and opportunities in computer science and to spark interest in relevant university programs with an open and inspiring community.

    https://www.starcode.de

Veröffentlichungen

  • Post-hoc Probabilistic Vision-Language Models

    arXiv preprint

    Vision-language models (VLMs), such as CLIP and SigLIP, have found remarkable success in classification, retrieval, and generative tasks. For this, VLMs deterministically map images and text descriptions to a joint latent space in which their similarity is assessed using the cosine similarity. However, a deterministic mapping of inputs fails to capture uncertainties over concepts arising from domain shifts when used in downstream tasks. In this work, we propose post-hoc uncertainty estimation…

    Vision-language models (VLMs), such as CLIP and SigLIP, have found remarkable success in classification, retrieval, and generative tasks. For this, VLMs deterministically map images and text descriptions to a joint latent space in which their similarity is assessed using the cosine similarity. However, a deterministic mapping of inputs fails to capture uncertainties over concepts arising from domain shifts when used in downstream tasks. In this work, we propose post-hoc uncertainty estimation in VLMs that does not require additional training. Our method leverages a Bayesian posterior approximation over the last layers in VLMs and analytically quantifies uncertainties over cosine similarities. We demonstrate its effectiveness for uncertainty quantification and support set selection in active learning. Compared to baselines, we obtain improved and well-calibrated predictive uncertainties, interpretable uncertainty estimates, and sample-efficient active learning. Our results show promise for safety-critical applications of large-scale models.

    Andere Autor:innen
    Veröffentlichung anzeigen
  • Probabilistic MIMO U-Net: Efficient and Accurate Uncertainty Estimation for Pixel-wise Regression

    Arxiv

    Uncertainty estimation in machine learning is paramount for enhancing the reliability and interpretability of predictive models, especially in high-stakes real-world scenarios. Despite the availability of numerous methods, they often pose a trade-off between the quality of uncertainty estimation and computational efficiency. Addressing this challenge, we present an adaptation of the Multiple-Input Multiple-Output (MIMO) framework -- an approach exploiting the overparameterization of deep neural…

    Uncertainty estimation in machine learning is paramount for enhancing the reliability and interpretability of predictive models, especially in high-stakes real-world scenarios. Despite the availability of numerous methods, they often pose a trade-off between the quality of uncertainty estimation and computational efficiency. Addressing this challenge, we present an adaptation of the Multiple-Input Multiple-Output (MIMO) framework -- an approach exploiting the overparameterization of deep neural networks -- for pixel-wise regression tasks. Our MIMO variant expands the applicability of the approach from simple image classification to broader computer vision domains. For that purpose, we adapted the U-Net architecture to train multiple subnetworks within a single model, harnessing the overparameterization in deep neural networks. Additionally, we introduce a novel procedure for synchronizing subnetwork performance within the MIMO framework. Our comprehensive evaluations of the resulting MIMO U-Net on two orthogonal datasets demonstrate comparable accuracy to existing models, superior calibration on in-distribution data, robust out-of-distribution detection capabilities, and considerable improvements in parameter size and inference time.

    Andere Autor:innen
    Veröffentlichung anzeigen
  • Moore-Kurven

    Practical Course: Computer Architecture

Kurse

  • Analysis

    MA0902

  • Discrete Probability Theory

    IN0018

  • Introduction into Optimization

    MA2012

  • Linear Algebra

    MA0901

  • Mathematical Methods for Imaging and Visualization

    IN2124

  • Numerical Linear Algebra

    IN0019

  • Seminar - Deep Learning in Physics

    IN4939

Projekte

  • encore.

    –Heute

    Encore lets you share your Spotify party with friends and family. After joining a session, users can vote on exactly what songs get played and queued. Encore gives users the ability to link their Spotify accounts to tune in and stay synchronized with their current session. When a user is synchronized to a session, the session's playlist simultaneously plays on any of the user's Spotify devices, perfect for a virtual party! Alternatively, users can simply suggest and vote on songs, giving every…

    Encore lets you share your Spotify party with friends and family. After joining a session, users can vote on exactly what songs get played and queued. Encore gives users the ability to link their Spotify accounts to tune in and stay synchronized with their current session. When a user is synchronized to a session, the session's playlist simultaneously plays on any of the user's Spotify devices, perfect for a virtual party! Alternatively, users can simply suggest and vote on songs, giving every party guest a say in what plays on!

    Andere Mitarbeiter:innen
    Projekt anzeigen
  • Unsere Planie

    Building an app focusing on interactive games designed to familiarize kids with garbage separation. Users are able to compete for prizes individually and in teams of buildings and neighborhoods. The app also provides local information on waste separation as well as general information on its environmental impact.

    Stack: React + Typescript / Golang / MongoDB

    Andere Mitarbeiter:innen
    Projekt anzeigen

Sprachen

  • Deutsch

    Muttersprache oder zweisprachig

  • Latein

    Grundkenntnisse

  • Altgriechisch

    Grundkenntnisse

  • Englisch

    Verhandlungssicher

Erhaltene Empfehlungen

Anton Baumanns vollständiges Profil ansehen

  • Herausfinden, welche gemeinsamen Kontakte Sie haben
  • Sich vorstellen lassen
  • Anton Baumann direkt kontaktieren
Mitglied werden. um das vollständige Profil zu sehen

Weitere ähnliche Profile

Weitere Mitglieder namens Anton Baumann in Deutschland