Hanna ZENG
Hanna ZENG

Hanna ZENG

( MS @ Harvard, BS@UofT)
Navigation Bar
Portfolio:
notion image
Video preview
  • Released Python package
notion image
  • Sleep State Detection Modeling
  • Gratitude to Strangers App (Groupwork)
  • Bike Thefts Viz
  • Education Study Web
“The life you live will expand or shrink in proportion to the courage you display.”
 
My preferred first name is Hanna, and I graduated from the Data Science Masters program at Harvard. Prior to attending Harvard HDS program with full scholarship, I completed my undergraduate studies at the Faculty of Arts and Science, University of Toronto St. George. During my time there, I pursued a double major in Computer Science & Statistical Science, with a minor in Mathematics. I graduated with an honors bachelor's degree from three departments, receiving high distinction. Before joining the University of Toronto, I underwent over 1.5 years of non-degree training at Beijing Normal University-Hong Kong Baptist University. I am immensely grateful for the guidance and support from my mentors and advisors who have played a crucial role in helping me achieve my goals!
My passion lies in exploring data-driven interdisciplinary topics, and I have previously collaborated on projects related to AI and machine learning with applications in healthcare, education, finance, and social economics. Currently, I am actively seeking for DataOps/MLOps Engineer opportunities. Having dedicated a significant amount of time to research, I'm truly enthusiastic about acquiring hands-on experience and discovering my capabilities within industry roles.
I have a wide range of interests and unlimited potential. I believe in continuous learning and growing with the advancements of the era.
If you have any open positions still in hiring, reach out to me right now! I am happy to know more about the qualifications of applicants that you are looking for and discuss any chances to work. I am eager to contribute my skills and knowledge to meaningful projects in the field. 🙂
 

📚 Education

Master of Science in Data Science , Harvard University
Information Science with applications in Healthcare/Financial Insurance/Tech etc.
During my time at Harvard, I am fully funded by admission scholarships and serve as the Academic Program Representative and International Students Advocate in the Harvard Chan Student Association. I have also been selected as a committee member for advising groups affiliated with the Career and Development Office.
  • Cross registration EECS, MIT
All Graduate Level
DataOps plus domain knowledge eCommerce/Health … etc (Click to see more)
CS
Stat
Eco & Finance & other social science
Public Health
Harvard Medical School
MLOps
Statistical Inference(waived)
Biology & Social Networks
Epidemiology
Visualization
System Development
Independent Study
Public Health Foundations
DS and ML
High Performance Computing
Computer Networks
Data Science II
Data Science I
Computing for Big Data(Spark, parallel computing)
Javascript d3 - Front end + Visualization
MIT NLP
 
Grade

Honour’s BSc. in Computer Science ( 🎓 ) University of Toronto
double majoring in Statistics, minor in Mathematics&Finance
📑 Selected Relevant Coursework
*Graduate Level
CS STAT MATH plus domain knowledge ECO/FIN/BIO/CGS/PHL … etc (Click to see more)
CS
Stat
Math
Eco & Finance & other social science
Quant Business
other Sciences
Computer organization
Probability & Statistics I&II
Linear Algebra I&II
Mirco economics
Accounting
Nutrition Science
*Computer graphics
*Advanced probability (pending)
Multivariate Calculus I&II (several variables&vector cal)
Macro economics
Business/marketing intro
Philosophy, Science ethics
AI & ML (Oxford)
*Time series analysis (pending)
Ordinary differential equation
Political science quant empirical research
Business data analytics
*Neural networks and deep learning
*Statistical machine learning I&II
Combinatorics
Financial planning and investment analysis
Theory of Computation
*Methods of data analytics
Elements of Analysis(real+complex) (pending)
Data structure and algorithms
*Categorical data analysis
Complex Analysis
Cognitive science of AI (Yale)
Regression analysis
Symbol logics
Data visualization
*Image understanding (pending)
*Bayesian statistics
Computational media (HCI)
Database systems
Object-oriented programming
Structured programming
Discrete mathematics
Software design
 
UofT Coursework page
grade

💼 Work

Compass Group - Compass Digital Labs | AWS/Snowflake/Airflow/Spark/EMR/EKS/GCP | Data Engineer 2024 May - Present
a Fortune Global 500 company, the world's largest B2B food and beverage service company, multinationally employing over 500,000 people.
  • listed on the London Stock Exchange
Tickets:
cleantelligent - simple ingestion
definitive healthcare - complex ingestion
remedy - (MS Sql Server) redshift2snowflake migration
toast - api ingestion - flatten complex JSON from various payloads with hierarchical data into different fact and dimension tables
confidential employee data (PGP encrypted) from SFTP - complex ingestion
insurance claims
UHN Toronto General Hospital - E-Health Centre for Digital Therapies | LLM | Data Engineering | Research Volunteer 2024
CDTx EHealth Innovation; Contributing to clinical notes(pdf) understanding and voice agent development
Independent Study+Capstone for credits
Apple  | Database/Data Crawling/Large Language Model(LLM) | AI/ML intern  2024 (Offer Declined Due to personal plan& interests change, also pay reasons. Passed the background check process which was required for this unconditional offer)
notion image
notion image
notion image
Harvard Widener Library Digital Reserves Employee | Adobe/LaTex/Digitizing | Support Digital Accessibility  2024 April - Aug (part-time)
➢ Digital Reserves Team - Harvard Widener Library. Manage digitizing and scanning of materials in support of teaching and learning at Harvard University each semester. Create accessible pdfs, track digitizing requests, process requests, maintain the Digital Reserves collection to better service Harvard community. Re-type complex math equations, transcribe intricate mathematical equations into digital formats, ensuring their accuracy and readability for academic use
Rotman School of Management | Python/AWS/SQL/ETL | Data Role Work Study Paid  2022 Sep -2023 March
➢ Used AWS to help data pipelines and data processing workflows for large TBs retail financial data, in support of car buyers’ behavior analysis. Collaborated on ETL tasks, as well as the creation of data visualization tools. Worked on statistical modeling to uncover insights into car buyers’ behaviors based on demographic factors, such as gender and car configuration preferences (private repo)
Hongzon Information Technology Co., Ltd | SPSS/R/Python/Axure | IT Product Intern 2020 Summer (Know myself not a great fit for product role in the very beginning haha, but anyway a good trial! Later I switched my interests to Data, AI Engineer )
➢ Designed product strategies based on analysis of market demand, communicated with the technical department and commercial companies, presented visual charts and series of Axure-based prototype diagrams and product research documents

⌨ Projects

⌨ Projects Details

🎮 Games For Fun

Goldminer Demo, Rotating Rose Demo | MATLAB
Breakout (Assembly) Report DemoComingSoon

🏫 Professional Activities

  • Publications:
Journal:
Qi Xu. Xiaoke Cao. Geping Chen. Hanqi Zeng. Haoda Fu. Annie Qu. "Multi-label residual weighted learning for individualized combination treatment rule." Electron. J. Statist. 18 (1) 1517 - 1548, 2024. https://doi.org/10.1214/24-EJS2236
  • Conference Reviewer (3+ years):
International Conference on Artificial Intelligence and Statistics, 2023👇
(AISTATS,2023)
International Conference on Artificial Intelligence and Statistics, 2024👇
(AISTATS,2024)
International Conference on Artificial Intelligence and Statistics, 2025👇
(AISTATS, 2025)

💻 Prior Research Journey

When it comes to academic fields, my most recent AI practice lies in information retrieval, data mining, recommendation. ( prior experience with applications in health-tech/edu). I cross registered NLP course in 2023 Fall at MIT, a great course taught by Prof. Yoon Kim.
Deep Learning in Optimal Individualized Omni-channel Treatment Decision Rule from a Multi-label Classification Perspective Code
| Python |  QuLab (Research Exchange) Individual Study July 2022 - Present
➢ Designed a Multi-Label Residual Weighted Learning (MLRWL) framework with a novel ITR estimation method for combination treatments incorporating interaction effects among treatments with applications in precision medicine
➢ Co-authored paper the Electronic Journal of Statistics (EJS) Vol. 18 (2024) 1517–1548
[PDF] Multi-Label Residual Weighted Learning for Individualized Combination Treatment Rule | Semantic Scholar
This paper introduces a novel ITR estimation method for combination treatments incorporating interaction effects among treatments, proposing the generalized $\psi$-loss as a non-convex surrogate in the residual weighted learning framework, offering desirable statistical and computational properties. Individualized treatment rules (ITRs) have been widely applied in many fields such as precision medicine and personalized marketing. Beyond the extensive studies on ITR for binary or multiple treatments, there is considerable interest in applying combination treatments. This paper introduces a novel ITR estimation method for combination treatments incorporating interaction effects among treatments. Specifically, we propose the generalized $\psi$-loss as a non-convex surrogate in the residual weighted learning framework, offering desirable statistical and computational properties. Statistically, the minimizer of the proposed surrogate loss is Fisher-consistent with the optimal decision rules, incorporating interaction effects at any intensity level - a significant improvement over existing methods. Computationally, the proposed method applies the difference-of-convex algorithm for efficient computation. Through simulation studies and real-world data applications, we demonstrate the superior performance of the proposed method in recommending combination treatments.
[PDF] Multi-Label Residual Weighted Learning for Individualized Combination Treatment Rule | Semantic Scholar
➢ Aimed at figuring out an optimal Individualized Omni-channel Treatment Rule in precision medicine situations
Deep learning-based Recommender Systems Code
| TensorFlow/PyTorch | Data Intelligence Lab Research Assistant May 2022 Supervisor: Chao Huang, Department of Computer Science, Hong Kong University Data Intelligence Lab ➢ Re-implemented Neural Graph Collaborative Filtering Model and Hypergraph Contrastive Collaborative Filtering ➢ Used in recommended scenarios where collaboration signals hidden in the user-item interaction cannot be ignored and need comprehensive capture of complex higher-order dependencies among users in the embedding process
Interplay of factors associated with risk in the pre-disease stage of Crohn's disease Code
| Python | Data Sciences Institute of University of Toronto| Research Scholar March 2022 Supervisor: Kenneth Croitoru, Clinician-Scientist and Professor, Luenefeld Tannenbaum Research Institute, Mount Sinai ➢ Implemented Logistic Regression and Casual Inference tools to predict the risks of Crohn’s Disease via Bile Acid profiles ➢ Helped clinical disease screenings and risk detection in patients’ early stage
AI-Based Language Chatbot2.0 Code
| Python/SQL/WXML | contribution acknowledged in publication Emerging Technologies for Education Dec 2020 – Jul 2021 Supervisor: Raymond Lee, Hong Kong Quantum Finance Forecast Center ➢ Designed a database (Mongo DB for Tencent Cloud) and used AI technologies automatic options generation and speed recognition for the development of an English Language Concept Learning Agent App ➢ Combined English concept learning, assisted and conducted English learning by providing various exercises such as multiple-choice and phonetic questions( WordNet, Word2vec, BERT)

🏆 Honours and Awards

  • First Class Scholarship 2020
  • President’s Honour Roll for 2 academic years
  • Dean’s List Scholar for 2 academic years
  • Harvard Central Grant & Stipend (full tuition & stipend admission) = US $105,000+
1st year Master:
  • Victor and William Fung Fellowship ~$25,000
  • Dr. Theodore Montgomery Scholarship Fund ~$19,974
  • Load Relief Award ~$6,534
  • Harvard Stipend ~$12,200
  • Harvard Central Grant ~$6,292
2nd year Master:
  • Dr. Theodore Montgomery Scholarship Fund ~$19,974
  • Harvard Central Grant ~$9,971
  • Harvard Stipend ~$5,055
 

Arts🎨 Music🎸 & STEM🤖️

Integrating Arts, Music, and STEM
💡
Mentorship DCS Women Mentorship@Utoronto
💡
Elevate Festival 2022
💡
🎼 Music Time & My 📷 Photography Gallery

📝 Other

2020 Yale Summer Session (Cognitive Neuroscience, Ethics of AI) Dr. Joanna Lawson, Department of Philosophy
2021 Mathematical/Interdisciplinary Contest in Modelling
2021 Spring Huawei Kunpeng Training Program
2021 Summer Huawei Seeds for The Future in Canada
2022 Oxford Study Abroad (AI and Machine Learning) Dr. Rob Collins and Dr. Nigel Mehdi, CS Department Individual Assignment Project: astrophysics research Code
UofT Computer Science Student Union (CSSU) member; Women in Computer Science community(WiCS) ; 3M Running Club member
Canadian Open Mathematics Challenge (COMC) grader
Leadership Portfolio (partial, mostly freshman year)
Cocurricular Transcript download by clicking here

Contact ☎️ Me
Check my LinkedIn 👔  / GitHub ⌨️  by clicking the floating icons on this web
Longwood Medical Area, MA Boston, 02115, USA / Harvard Square, MA Cambridge, 02138, USA
 
* This web framework version was built back in 2022.

 
陈攀bb我爱你😘