Catégorie profils: Machine learning and optimization

L’Ecole normale supérieure - PSL / FAIR Paris

stephane.dascoli [at] gmail.com

Short bio

Master in Theoretical Physics, ENS Paris

Thesis title

Deep learning: from toy models to modern architectures.

Short abstract

My research focuses on understanding how deep neural networks are able to generalize despite being heavily overparametrized. On one hand, I use tools from statistical mechanics to study simple models, and try to understand when and why they overfit. On the other hand, I investigate how different types of inductive biases affect learning, from fully-connected networks to convolutional networks to transformers.

YAMANE Ikko

Postdoctoral researcher

Dauphine - PSL

ikko.yamane [at] dauphine.psl.eu

Short bio

Ph.D. from The University of Tokyo

Research topic

Counter factual inference with weakly supervised learning

Short abstract

In counter factual inference, one tries to predict what would happen if attributes of data were some values different from that actually observed. Existing counter factual inference methods often require expensive, controlled experiments to be conducted to collect necessary data. My research interest focuses on developing methods that only need cheaper and efficient experiments possibly with missing observations or milder conditions.

ROMAIN Manon

PhD student

L’Ecole normale supérieure - PSL

manon.romain [at] inria.fr

Short bio

Diplôme de l’Ecole polytechnique
MSc of Computational and Mathematical Engineering – Stanford University

Thesis title

Study of causal networks.

Short abstract

Causal inference is very important to a wide range of use from clinical trials to econometrics: we learned that “correlation is not causation” but how can we learn true causal relationships? We will using learning of causal diagrams using the latest advances in optimization. We will also study experimental design, given your current knowledge, how to best use your limited resources to gain insightful causal information (e.g., by doing biological experimentations)?

MERAD Ibrahim

PhD Student

Université de Paris

ibrahim.merad [at] etu.u-paris.fr

Short bio

Master 2 MVA , ENS Paris-Saclay

Thesis title

Apprentissage non-supervisé de représentations et applications en santé (Unsupervised representation learning and applications in healthcare).

Short abstract

Unsupervised representation learning has recently caught up with the performance of supervised approaches thanks to the introduction of contrastive methods. The mathematical study of these new methods is essential to better understand and exploit them as well as provide guarantees on their performance. Their application is especially relevant in healthcare where supervision is commonly lacking.

DOUIN Adèle

PhD Student

L’Ecole normale supérieure - PSL

adele.douin [at] phys.ens.fr

Short bio

Master Degree “Physics of Complex Systems” at Université de Paris

Thesis title

Knit-quakes Prediction through Machine Learning.

Short abstract

Knitted fabric is a thread-based metamaterial that exhibits crackling noise both in the force signal and in the deformation field when deformed. The occurrence of those intermittent, scale-invariant events cannot be accurately predicted at the time being, as in most systems with similar avalanche-like behavior. However, Machine Learning methods have proven to be a useful tool when dealing with time-series predictions and image processing, giving hope to be able to predict relevant knit-quakes quantities.

BRION BOUVIER Florie

PhD Student

Université de Paris

f.brionbouvier [at] gmail.com

Short bio

Master of Science in Public Health, Comparative Effectiveness Research (University of Paris)

Thesis title

Estimation of the Individual Treatment Effect via Machine Learning using an Individual Participant Meta-Analysis.

Short abstract

Personalized medicine aims at tailoring a treatment to the individual characteristics of each patient. One key aspect of personalized medicine is to identify the subgroups of patients who benefit from an intervention, which we can do by estimating the Individual Treatment Effect (ITE). My goal is to develop Machine Learning methods to estimate the ITE using Individual Participant Meta-Analyses with binary and time-to-event outcomes. I will develop these methods on RCTs and observational data. I will also use federated learning to estimate the ITE.

ARJOVSKY Martin

Postdoctoral researcher

INRIA

martinarjovsky [at] gmail.com

Google scholar

Short bio

PhD, New York University

Research project

Out of Distribution Generalization in Machine Learning.

Short abstract

Machine Learning suffers from a fundamental problem: when systems are deployed in situations different than they were trained on, they can fail catastrophically. I aim to improve a large family of these problems by leveraging ideas from causality, deep learning, and the recognition of invariant statistical patterns. I am particularly focused in high-dimensional nonlinear problems (such as vision, language, and speech).

GANASSALI Luca

PhD Student

INRIA / l’Ecole normale supérieure - PSL

luca.ganassali [at] inria.fr

Short bio

Graduated from Ecole Polytechnique in 2019

Master’s degree at Université Paris-Sud

Research domain

Machine Learning and Optimization

Thesis title

Statistical inference on graphs: the graph alignment problem.

Short abstract

We study inference problems in graphs and matrices, such as graph alignment, which aims at finding a matching between nodes of two graphs preserving most of the edges. In a Bayesian setting, several approaches may be followed: analyzing rigorously existing algorithms based on random models to determine the regimes in which they may succeed, look at the information-theoretical and computational thresholds, or design and propose new algorithms that explore new regimes.

CHADEBEC Clément

PhD student

INRIA

clement.chadebec [at] sorbonne-universite.fr

Short bio

Master 2 – Mathématiques, Vision & Apprentissage – ENS Paris-Saclay
Ingénieur Civil des Mines de Paris – Mines ParisTech

Thesis title

Exploration of the latent space modelling of Variational Autoencoders.

Short abstract

In the medical field, the lack of data resulting in low patient variability remains a key issue. For example, in neuroscience, practitioners have to deal with potentially very high-dimensional data combined with a very small number of samples. Generative models such as variational autoencoders (VAE) may reveal particularly well suited to perform dimension reduction on such data and their generative capacity could be used for data augmentation. Unfortunately, VAEs perform weakly when trained on a (very) small number of data and the underlying structure of the latent space remains poorly understood. We believe that further investigating the latent space geometry of these generative models could allow us to overcome these issues.

YGER Florian

florian.yger [at] dauphine.fr

Short bio

Florian Yger is an associate professor of computer science at Université Paris Dauphine-PSL and a researcher in the MILES team at LAMSADE since 2015. From 2014 to 2015, he was a JSPS postdoctoral fellow in the laboratory of Pr. Sugiyama in Tokyo University. He received his PhD in Computer science from LITIS, Université de Rouen under the supervision of Alain Rakotomamonjy in 2013. He is a visiting researcher at RIKEN AIP, Japan since 2017.

Topics of interest

Trustworthy machine learning, Causal inference, interpretable AI

Project in Prairie

Florian Yger will address the questions of trust, explainability and interpretability in machine learning models (including deep learning) with a focus on the robustness to adversarial examples and counterfactual reasoning on data. This project has natural and practical applications in the medical field.

Quote

In the last decade, deep learning has made possible breakthrouhgts in several domains (e.g. computer vision, machine translation, games, …). Yet those hardly interpretable algorithms are fed with huge amounts of -sometimes sensitive- data and can suffer from malicious attacks: attacks on the privacy of the data and attacks on the robustness where adversarial examples are generated to fool the algorithm. This is a critical issue (especially in medical applications) and we feel that an effort toward a deeper theoretical analysis is needed.

Team

YAMANE Ikko

Postdoctoral researcher

Postdoc

RUDI Alessandro

alessandro.rudi [at] inria.fr

Short bio

Alessandro Rudi is Researcher at INRIA, Paris from 2017. He received his PhD in 2014 from the University of Genova, after being a visiting student at the Center for Biological and Computational Learning at Massachusetts Institute of Technology. Between 2014 and 2017 he has been a postdoctoral fellow at Laboratory of Computational and Statistical Learning at Italian Institute of Technology and University of Genova.

Topics of interest

Large scale machine learning, structured prediction

Project in Prairie

My main research interest is in machine learning. In particular my focus is on theoretical and algorithmic aspect of statistical machine learning, with the goal of devising algorithms that at the same time can (a) scale on big data (b) be easily applied in practice (c) have strong theoretical guarantees in terms of statistical and computational aspects (d) achieve state of the art error on the prediction, with reduced computational costs.

Quote

Large Scale ML with statistical guarantees. ML algorithms can be divided in (a) non-parametric, with strong theoretical guarantees, but high computational requirements especially in terms of memory footprint, as kernel methods (b) parametric, as deep nets, with small computational complexity and effective results in practice, but without theoretical guarantees. The goal of my work is to develop hybrid methods that take the best of both worlds: fast, effective and with theoretical guarantees. Structured Prediction. Nowadays data are very often more complex than vectors. In many fields learning problems consist in predicting structured/complex objects from other structured objects. Using the power of infinite-dimensional implicit embeddings, my goal in this direction consists in developing a unified theoretical and algorithmic framework able to deal effectively with a wide family of structured inputs and outputs.

Team

BEUGNOT Gaspard

PhD student

PhD student

MALLAT Stéphane

Data Sciences

stephane.mallat [at] ens.fr

Short bio

Professor at NYU from 1988 to 1994. Professor at Ecole Polytechnique, from 1994 to 2012. Co-founder and CEO of a semiconductor start-up from 2001 to 2007. Professor in Computer Science at Ecole Normale Supérieure from 2012-to 2017. Professor at the Collège de France in Data Sciences since 2017. Member of the French Academy of sciences, of the French Academy of Technologies and foreign member of the US National Academy of Engineering. IEEE and EUSIPCO Fellow. Recipient of the SPIE 2007 Outstanding Achievement Award, of the 2004 European IST Grand prize, of the 2004 INIST-CNRS prize for most cited French Researcher in engineering, of the 2015 IEEE Signal Processing best sustaining paper award, of the 2017 IEEE Freidrich Gauss Prize.

Topics of interest

Harmonic analysis, machine learning, signal processing

Project in Prairie

Stéphane Mallat will be working on the mathematical understanding and interpretability of deep neural networks with applications to images, audio, financial data, quantum chemistry and cosmology. He teaches Data Sciences and will promote the interface between industry, academia and students through the organization of data challenges in http://challengedata.ens.fr

Quote

Deep neural networks have considerable applications to services, industry, and science but remain a black box whose properties are not well understood. Robustness and interpretability of deep neural networks become a major issue for their applications. Understanding deep networks involves many branches of mathematics, including statistics, harmonic analysis, geometry, and optimization in high dimension, together with algorithmic experiments on real data. Working on very different types of data and applications, gives an access to generic mathematical and algorithmic properties of these networks. Simplifying network architectures, while preserving performance is an important direction of this research.

Team

MOREL Rudy

PhD Student

PhD student

D’ASPREMONT Alexandre

Optimisation

aspremon [at] ens.fr

Short bio

After dual PhDs from Ecole Polytechnique and Stanford University in optimisation and finance, followed by a postdoc at U.C. Berkeley, Alexandre d’Aspremont joined the faculty at Princeton University as an assistant then associate professor. He returned to Europe in 2011 and is now a research director at CNRS, attached to Ecole Normale Supérieure in Paris. He received the SIAM Optimization prize, a NSF CAREER award, and an ERC starting grant. He co-founded and is scientific director of the MASH Msc degree at PSL. He also co-founded Kayrros SAS, which focuses on energy markets and earth observation.

Topics of interest

Optimisation, machine learning

Project in Prairie

Alexandre d’Aspremont’s work is focused on optimization and applications in machine learning, statistics, bioinformatics, signal processing and finance. He collaborates with several companies on projects linked to earth observation, insurance pricing, statistical arbitrage, etc. He is also co scientific director of MASH, a Msc program focused on machine learning and its applications in digital marketing, journalism, public policy, etc.

Quote

Optimization plays a central role in modern statistics and machine learning
in particular. Beyond direct applications of optimization algorithms, convex duality results and complexity theory underpin many recent developments in statistical learning (e.g. compressed sensing or matrix completion problems). Yet the link and clear empirical tradeoff between statistical performance and computational complexity has yet to be fully explained, which are crucial in most domains, in particular safety/health critical ones.

Team

MISCHENKO Konstantin

Postdoctoral researcher

Postdoc

ROMAIN Manon

PhD student

PhD student

DUBOIS-TAINE Benjamin

PhD student

PhD student

BACRY Emmanuel

bacry [at] ceremade.dauphine.fr

Short bio

Senior Researcher CNRS (2016-), Chief Scientific Officer of the Health Data Hub (2019-), Head of Health/Data projects, Ecole Polytechnique (2019- ), Head of Data Science & Big Data Initiative and Associate professor at Ecole Polytechnique (2014-2019).

Topics of interest

Machine learning, Point processes, Big data, Health

Project in Prairie

Emmanuel Bacry will work on AI algorithms applied to longitudinal data with a focus on healthcare data of the SNDS database. He will work on various applications such as detecting weak signals in pharmacoepidemiology or optimization of health pathways of a given pathology, developed in close collaboration with CNAM and the Health Data Hub.

Quote

The SNDS database is a unique medico-administrative health database. It is one of the largest in the world (>65 million people, >200Tb). Though it contains no clinical data, it is very rich, and can contribute to major applications with strong potential impacts on health and/or on economy of health. Moreover, thanks to the Health Data Hub, this database will be soon enriched by large amount of clinical databases. Working on these very large databases is complex but extremely exciting.

Team

SUSMANN Herbert

Postdoctoral researcher

PhD, Biostatistics, University of Massachusetts Amherst

BOUDABOUS Safa

Postdoctoral researcher

PhD, Computing, Data and AI, Institut Polytechnique de Paris

ZARHALI Othmane

PhD student

Ecole des Mines de Nancy, Engineering degree, major: Applied mathematics Ecole Polytechnique / Sorbonne Université, Msc Probabilités et finance (ex DEA El KAROUI)

BACH Francis

Machine learning

francis.bach [at] inria.fr

Short bio

Researcher at Inria, leading since 2011 the machine learning team which is part of the Computer Science department at Ecole Normale Supérieure. Ph.D. Berkeley (2005). ERC Starting grant (2009) and Consolidator Grant (2016), Inria young researcher prize (2012), ICML test-of-time award (2014), Lagrange prize in continuous optimization (2018). Co-editor-in-chief of the Journal of Machine Learning Research. Member of the Academy of Sciences.

Topics of interest

Machine learning, Optimization, Statistics

Project in Prairie

Francis Bach will address fundamental problems in machine learning, using optimization methods that come with theoretical guarantees and can help solve challenging problems in computer vision and natural language processing. He will participate in the teaching effort through his existing classes and in collaborations with PRAIRIE industry partners.

Quote

Machine learning algorithms are ubiquitous in most scientific, industrial and personal domains, with many successful applications. As a scientific field, machine learning has always been characterized by the constant exchanges between theory and practice, with a stream of algorithms that exhibit both good empirical performance on real-world problems and some form of theoretical guarantees. Many of the recent and well publicized applications come from deep neural networks, where these exchanges are harder to make. Bridging this new gap between theory and practice is key to obtaining performance guarantees and uncertainty estimates, which are crucial in most domains, in particular safety/
health critical ones.

Team

Ivan LERNER

PhD student

Diplôme d’etude supérieur de Santé Publique, Université Paris Descartes, 2020 Double-cursus Médecine-sciences, 2016, Université Paris Descartes Master 2 de science-cognitives, 2013, École normale supérieure, Paris

MARTIN Simon

PhD student