Wonjune Lee

AI Engineer & Data Scientist

Wonjune Lee

Building meaningful systems
with data and AI

I study Computational & Data Sciences at
George Mason University.
I currently conduct research at the Ulsan National Institute of Science and Technology (UNIST)
Artificial Intelligence Graduate School.

Python Machine Learning Deep Learning AI R SQL RAG Data Visualization FastAPI

Education

  • George Mason University B.S. Computational & Data Sciences Aug 2023 — May 2026
  • Ghent University Bioscience Engineering Feb 2019 — Aug 2021

Experience

  • Ulsan National Institute of Science and Technology (UNIST) Undergraduate Researcher - Artificial Intelligence Graduate School Jul 2025 — Current

Skills

Python
ML / DL
R
SQL
3.93/4.0
GPA
1
Paper
9
Projects

Resume

Resume Preview

Selected Work

Projects Overview

Machine Learning Deep Learning Graph Transformer Python PyTorch
Water Pipeline Leakage Classification
Comparative study of ML and Graph Transformer methods achieving 96.4% accuracy on real-world sensor data for water pipeline leak detection.
Hackathon LLM FastAPI Python Backend
Swift Grader
LLM-powered essay grading system built at HackFax 2026, reducing manual grading time by ~80% with rubric-based evaluation.
Data Analytics SQL Oracle Prediction
Premier League SQL Analytics
Normalized SQL database and advanced analytics on 2023–24 EPL statistics, identifying key performance indicators correlated with league rankings (r ≈ 0.6).
Data Visualization R ggplot2 EDA
Crime Rates in Washington, D.C.
Visualized 8-year crime trends across multiple offense categories and analyzed relationships with student population and housing prices.
Recommendation System SVD Collaborative Filtering Python
Anime Recommendation System
Scalable recommendation engine using collaborative filtering and SVD matrix factorization on a large sparse MyAnimeList dataset (RMSE ≈ 1.16).
ML Classification Random Forest Web Scraping Python
ESG Score Prediction
End-to-end ESG grade classification pipeline with web scraping, feature engineering, and Random Forest achieving >85% accuracy.
Predictive Analytics Logistic Regression Hypothesis Testing R
Heart Attack Risk Prediction
Logistic regression and hypothesis testing to predict heart attack risk, exploring lifestyle variable associations through EDA and permutation tests.
Linear Regression EDA Correlation Analysis Python
Movie Industry Analysis
Explored 40+ years of movie data with EDA, correlation analysis, and multiple linear regression to identify key predictors of box-office success.
N-Body Simulation Dark Matter NumPy Python
N-Body Simulation of the Triangulum Galaxy
2D N-body simulation comparing rotation curves with and without dark-matter halos, demonstrating the necessity of unseen mass.

Research

Publications & Talks

2025
Conference Paper
A Comparative Study of Machine Learning and Deep Learning Methods for Water Pipeline Leakage Classification
Wonjune Lee et al.
Korea Software Congress · Yeosu, Korea · December 2025

Get in Touch

Contact & Links

Feel free to reach out anytime
for new opportunities or collaboration.

I am open to research roles, internships,
full-time positions, and side projects.

Languages
Korean (Native) · English (Fluent)
Location
Fairfax, Virginia, USA