Lyra

The Lyra research project is a long-term research effort to enhance the understanding and reliabilty of data science software. It aims ad developing new practical and accessible analyses and tools to reason about and provide rigorous guarantees of the behavior of data analytics, big data, machine learning, and deep learning applications.

Lyra

Lyra is an umbrella project including the following focused research projects:

Libra (focused on fairness-aware training and certification of machine learning models)
Sedano (focused on designing and developing static analyses for data science notebooks)

Completed Projects

Abhinandan Pal (Bachelor Student, IIIT Kalyani, India)
Abstract Interpretation-based Feature Ranking for SVMs
Research Internship (remote), 2022
Serge Durand
Static Analysis by Abstract Interpretation of the ACAS Xu Neural Networks
M1 Internship, École Normale Supérieure, 2020
Radwa Sherif Abdelbar
Automated Checking of Implicit Assumptions on Textual Data
Bachelor’s Thesis, ETH Zurich, SS 2018
Lowis Engel
Usage Analysis of Data Stored in Map Data Structures
Bachelor’s Thesis, ETH Zurich, SS 2018
Madelin Schumacher
Automated Generation of Data Quality Checks
Master’s Thesis, ETH Zurich, AS 2017
Mostafa Hassan
Static Type Inference for Python
Bachelor’s Thesis, ETH Zurich, SS 2017
Simon Wehrli
Static Program Analysis of Data Usage Properties
Master’s Thesis, ETH Zurich, SS 2017

Publications

Caterina Urban. Static Analysis for Data Scientists. In CSV, 2023.

PDF Project HAL Springer

Satoshi Munakata, Caterina Urban, Haruki Yokoyama, Koji Yamamoto, Kazuki Munakata. Verifying Attention Robustness of Deep Neural Networks against Semantic Perturbations. In NFM, 2023.

PDF Project HAL Springer

Satoshi Munakata, Caterina Urban, Haruki Yokoyama, Koji Yamamoto, Kazuki Munakata. Verifying Attention Robustness of Deep Neural Networks against Semantic Perturbations. In APSEC, 2022.

Project IEEE

Satoshi Munakata, Caterina Urban, Haruki Yokoyama, Koji Yamamoto, Kazuki Munakata. Verifying Attention Robustness of Deep Neural Networks against Semantic Perturbations. In arXiv/2207.05902, 2022.

PDF Project arXiv

Caterina Urban, Antoine Miné. A Review of Formal Methods applied to Machine Learning. In arXiv/2104.02466, 2021.

PDF Project arXiv HAL

Caterina Urban. What Programs Want: Automatic Inference of Input Data Specifications. In arXiv/2007.10688, 2020.

PDF Code Project BibTeX arXiv

Caterina Urban. Static Analysis of Data Science Software. In SAS, 2019.

PDF Project Slides HAL Springer

Mostafa Hassan, Caterina Urban, Marco Eilers, Peter Müller. MaxSMT-Based Type Inference for Python 3. In CAV, 2018.

PDF Code Project Artifact BibTeX Springer

Caterina Urban, Peter Müller. An Abstract Interpretation Framework for Input Data Usage. In ESOP, 2018.

PDF Code Project Slides BibTeX Springer

Talks

Interpretability-Aware Verification of Machine Learning Software

Thursday, February 9, 2023 2:00 PM

Séminaire IRILL, 🇫🇷 Center for Research and Innovation on Free Software, France

Project

Static Analysis for Data Scientists

Friday, July 8, 2022 1:30 PM

Isaac Newton Institute Workshop “Vistas in Verified Software”, 🇬🇧 Cambridge, UK (remote)

Video Project

Static Analysis for Data Scientists

Tuesday, June 14, 2022 1:30 PM

11th International Workshop on the State Of the Art in Program Analysis (SOAP 2022), 🇺🇸 San Diego, USA

Project

Static Analysis for Data Scientists

Friday, May 20, 2022 2:00 PM

Challenges of Software Verification Workshop, 🇮🇹 Università Ca’ Foscari Venezia, Italy

Slides Video Project

Formal Methods for Robust Artificial Intelligence: State of the Art

Wednesday, January 13, 2021

Lorentz Center Workshop “Robust Artificial Intelligence”, 🇳🇱 Lorentz Center, The Netherlands (remote)

Slides Video Project

Static Analysis for Data Science

Monday, November 2, 2020 10:00 AM

🇫🇷 INSERM, France (remote)

Slides Project

A Guided Tour of a Static Analyzer for Data Science Software

Monday, July 20, 2020 7:15 AM

2nd Workshop on Democratizing Software Verification (DSV 2020), 🇺🇸 Los Angeles, USA (remote)

Slides Video Code Project

Static Analysis of Data Science Software

Wednesday, October 9, 2019 2:00 PM

26th Static Analysis Symposium (SAS 2019), 🇵🇹 Porto, Portugal

Slides Video Project Project

What Programs Want: Automatic Inference of Input Data Specifications

Tuesday, April 2, 2019 11:30 AM

Guest Seminars, 🇮🇹 Gran Sasso Science Institute (GSSI), Italy

Project

An Abstract Interpretation Framework for Input Data Usage

Monday, October 2, 2017 5:00 PM

NII Shonan Meeting Seminar 100 “Analysis and Verification of Pointer Programs”, 🇯🇵 Shonan Village Center, Japan

Project

An Abstract Interpretation Framework for Input Data Usage

Tuesday, September 12, 2017 3:30 PM

NII Shonan Meeting Seminar 108 “Memory Abstraction, Emerging Techniques and Applications”, 🇯🇵 Shonan Village Center, Japan

Project