My Projects
1. Student Statistical Consultant (Capstone Project)
- With guidance from Dr. Rodolfo Lourenzutti, I worked for a client analyzing changes in glaucoma eye surgery in an apprentice type of job.
- Real client work, regular meetings with the professor presenting work, poster board and oral presentations
- Produced reports, code, and worked directly for the client using the R programming language and GitHub
- Learned about complications with study design, model selection, and effective statistical communication
2. The Daily Grind (CMD-F Hackathon)
- A 24 hour Computer Science Hackathon on the weekend of March 8-9th, 2025
- Winners of the “Social Track” for our coffee-chat conversation starter using AI
- Won 3rd place overall, with 50+ projects and 200+ participants
- Presented work to 7 judges individually, and in the final round presented in front of the entire Hackathon community (over 250+ people).
- For more info, see our Devpost.
3. DSCI 310: Reproducible and Trustworthy Workflows
An analysis and package that is a fully reproducible and robust data analysis project, comprised of:
- A well documented and modularized software package and scripts written in R,
- A data analysis pipeline automated with GNU Make and Makefile’s,
- A reproducible report powered by R Markdown and Quarto,
- A containerized computational environment created and made shareable by Docker, and
- A remote version control repository on GitHub for project collaboration and sharing,
- As well as automation of test suite execution and documentation and software deployment.
For my group’s work, see the R package repository here and the analysis repository here.
4. Weather Extremes Research
- Working at UBC from May to August 2024 under an URSA NSERC scholarship
- Finding patterns and explanations for climate extremes. Dealt with Canadian weather data, relating to floods, droughts, and wildfires using Python and R languages. Completed EDA and used various statistical analyses to gain insight into weather patterns and time series analysis.
- Analyzed many research papers, presented work weekly, and assisted both professor Harry Joe and professor Natalia Nolde with various tasks.
- Work will be published on a public repository soon!
5. ASA DataFest
- A 48 hour Hackathon in a team of 3 for the American Statistical Association.
- Created a fully reproducible project, with EDA and a complex statistical analysis
- Created a dashboard, demonstration video, presentation using quarto, and analysis using a Bayesian model
- See the repository: https://github.com/hlan22/asa_datafest_2025
6. wordplayr
- An R Package
- The aim of
wordplay
is to learn more about your favorite list of words! Learn about metrics such as length, most frequent starting letter, and much more with ease! - A fully functioning R package with clear
README.Rmd
instructions on how to install and use, along with a vignette, website, and Roxygen comments to help guide the user. - I have built up a ‘list of favorite words’ over the years and wanting to learn more about them is what inspired me to make this package!
- See the R package repository and the package website.
7. Python Dashboard & Data Visualization
DSCI 320 Dashboard
With two group members, we created the following dashboard where we had to create inventive visualizations to explain 4 questions. We looked into trends for attendance at libraries in the San Francisco area. The dashboard is interactive and has toggle options that don’t show up in the saved image.
8. easyEDA
- A Python Package (In development)
- A Python Package (still in progress) such that you can “automate” EDA for any given dataset.
- Functions will quickly and easily create proper plots based on data types, for any and all columns.
- Code will be done with Python so that you can include interactive visuals, but will expand to R too.
- Link to the repository here