PALiISaDS conference for
Professional Readiness, Inclusive Scholarship, and Mentorship
April 24-26, 2026 | UCI Campus
Fall 2026 applications are OPEN! Sign up for an Info Session today!
PALiISaDS conference for
Professional Readiness, Inclusive Scholarship, and Mentorship
April 24-26, 2026 | UCI Campus
We are thrilled to invite you to our second annual in-person all sites meeting! Mark your calendars for an inspiring and action-packed weekend!
Friday, April 24
Check in begins 3:00 PM
Welcome reception onsite
Saturday, April 25
Light breakfast
Welcome, PALiISaDS!
Keynote speech
Education panel
Coffee break
Workshop session
Lunch break
Workshop session
Coffee break
Career panel
Closing words
Group dinner
Sunday, May
Farewell breakfast @ hotel
Check-out by 11:00 AM
Keynote
Director of Research Computational and Data Science (Computational Research), Rady Children’s Health
Louis Ehwerhemuepha, PhD is the Director of the Research Computational and Data Science (Computational Research) team at Rady Children’s Health in Orange County (formerly CHOC). He leads a multidisciplinary team of data scientists and statisticians focused on advancing pediatric research through the application of statistics, machine learning, and artificial intelligence. Under his leadership, the team applies methods ranging from statistical learning on structured electronic medical record (EMR) data to deep learning approaches for computer vision and natural language processing in pediatric medicine, while also providing research data science support across the health system.
Dr. Ehwerhemuepha’s applied research spans a broad range of clinical and population health domains, including hospital readmission, sepsis, COVID-19, artificial intelligence for rare diseases, population health management, and the care of children with complex chronic conditions, particularly neurological and cardiovascular disorders. He has led the deployment of multiple statistical and machine learning models into the EMR, contributing to measurable improvements in quality of care. He collaborates closely with University of California, Irvine faculty in Pediatrics, Statistics, and Data Science to address clinically meaningful problems and advance pediatric health outcomes.
Workshops
Dr. Allison Theobold
Assistant Professor, Statistics Department
Cal Poly in San Luis Obispo
This introductory workshop will cover the following topics:
Cleaning data
Visualizing relationships
Choosing predictor (input) variables
Choosing between competing models
Interpreting model conclusions
Data ethics
The workshop will be offered in both R and Python, so scholars can choose which language they prefer. Ideally, participants will be able to work locally (on their computer) with a Quarto file (.qmd) or a Python notebook (.ipynb).
Dr. Wasila Dahdul
Data Curation Librarian, UCI Libraries
University of California, Irvine
Secondary data is data originally collected by others and available for reuse. Such data can support large-scale analyses, enable new research directions, and provide access to data that is difficult or impossible to collect firsthand. This workshop will provide an overview of data sources and data discovery platforms for a variety of disciplines. We will then consider key issues in reusing data by using R to explore and visualize a real-world dataset, the U.S. National Parks Visit Data. Working through this example, participants will assess data suitability and quality, and gain an understanding of how to use documentation, provenance, and the social and historical context to work responsibly with data.
Dr. Kelly Bodwin
Associate Professor, Statistics Department
Cal Poly in San Luis Obispo
In this workshop, we will deep dive into three major elements of the machine learning workflow: model specification, feature engineering, and model tuning. Using real world example data, we will assess which of these three has the biggest impact on a model's predictive success. We'll also discuss strategies for choosing a feasible number of pipeline variants to test, and how to iterate progressively in the model fitting process.
This material assumes the learner is familiar with the basic pipeline structure of Scikit-learn (python) or tidymodels (R); with common variable transformations like logarithms or interaction terms; and with common assessment metrics such as MSE, R-squared, MAE, accuracy, precision, and recall.
Dr. Wasila Dahdul
Data Curation Librarian, UCI Libraries
University of California, Irvine
Secondary data is data originally collected by others and available for reuse. Such data can support large-scale analyses, enable new research directions, and provide access to data that is difficult or impossible to collect firsthand. This workshop will provide an overview of data sources and data discovery platforms for a variety of disciplines. We will then consider key issues in reusing data by using R to explore and visualize a real-world dataset, the U.S. National Parks Visit Data. Working through this example, participants will assess data suitability and quality, and gain an understanding of how to use documentation, provenance, and the social and historical context to work responsibly with data.
Panels
Sarah is currently pursuing a Ph.D. in Statistics at the University of California, Irvine. Additionally, Sarah holds a BS in Statistics and Data Science, and a BA in Geography from the University of California, Santa Barbara. Her research focuses on developing statistical methodology for adaptive clinical trial designs, as well as applied research in Alzheimer’s disease and related dementias.
Senior Data Scientist and Applied Researcher
Brian Vegetabile is a Senior Data Scientist and Applied Researcher at LinkedIn. He is an expert in Causal Inference and Machine Learning and is currently working in the Product Services & Marketplace Data Science organization focused on Systems and Marketplace Understanding. Prior to his current team, he worked on problems related to forecasting, anomaly detection, and root cause analysis focused on applied operations problems. He earned his bachelor's in Aerospace Engineering at Penn State before his PhD and Master's in Statistics at UC Irvine.
Assistant Professor of Clinical Population and Public Health Sciences in the Division of Biostatistics
University of Southern California
Dr. Nuño received a B.S. in Applied Mathematics from the University of California, Riverside in 2015, then went on to complete graduate work at the University of California, Irvine, where she received an M.S in Statistics in 2017 and a Ph.D. in Statistics in 2020. Her research interests include clinical trials and the development of robust statistical methodology for efficient sampling designs. She is particularly interested in biomedical applications in Alzheimer’s disease and pediatric oncology.
she/her/hers
Manager - CX Operations
ConnectWise
After graduating with a B.S. in Statistics from CSU Monterey Bay in 2022, Sophia pursued a career as a data analyst in the technology sector. As an individual contributor at ConnectWise, her priorities were system implementation, churn prediction and analysis, and data pipeline management. She now leads a team of analysts focused on integrating technology and people, improving how internal teams interact with technological systems, processes, and automations.
PhD student, University of California, Santa Barbara
Cadee Pinkerton is a PhD student at UC Santa Barbara, working in statistical theory, methodology, and applied statistics in environmental and biomedical sciences.
MS student, Cal Poly in San Luis Obispo
Allen Choi is currently a Master's student at Cal Poly SLO studying Statistics. He grew up in Koreatown, Los Angeles and some of his hobbies include reading and finding new music to listen to. After graduation, Allen hopes to continue his education through a PhD in Statistics.
PhD student, University of California, Irvine
Sean Leader is a PhD student at UC Irvine studying Statistics. Sean loves using his domain knowledge and computational skills to better understand the world around him, in both a personal and professional context. Sean is currently working to formalize new statistical methods that combine existing efficiency frameworks with applications to biomarker discovery for Alzheimer's Disease.
MS student, Cal Poly in San Luis Obispo
Faran Igani is currently a PhD student at UC Irvine, working in survival analysis with applications to biomarker discovery for Alzheimer's disease. Faran enjoy his role as both a TA and tutor, working with students all the way from introductory statistics to those studying for their qualifying exams.