Young-suk Lee

Assistant Professor

Korea Advanced Institute of Science and Technology

I am an Assistant Professor in the Department of Bio and Brain Engineering and Graduate School of Engineering Biology at KAIST. Previously, I had the opportunity to work very closely with molecular biologists in V. Narry Kim’s lab at Seoul National University. In 2016, I received my Ph.D. in Computer Science at Princeton University working with Olga G. Troyanskaya. As an undergraduate, I majored in both Computer Science and Mathematics at the University of Texas at Austin where I was first introduced to computational biology by Tandy Warnow.

Please read the Featured Spotlight for more about my journey as a computational biologist, my advice to undergraduates and graduate students, and why I stayed in academia.

If you are interested in working with me, please feel free to contact me with a brief overview of your background and research interests.

Interests

Bioinformatics
Functional genomics
Molecular biology
Probabilistic modeling
Single-nucleotide analysis

Education

PhD in Computer Science, 2016

Princeton University
BSc in Computer Science, 2010

The University of Texas at Austin
BSc in Mathematics, 2010

The University of Texas at Austin

Teaching

I never teach my pupils; I only attempt to provide the conditions in which they can learn.
– Albert Einstein

The advent of massive open online courses has changed the way we look at education and challenges traditional views on the role of instructors. Simple transfer of knowledge is no longer the rate limiting step for educating the next generation. Instead, knowledge is now accessible to anyone with a computer, tablet or mobile phone with a connection to the internet. I’ve also benefited tremendously from these initiatives, but at the same time forced me to reevaluate my pedagogical values. This led me to my three foundations of instruction and mentorship: construction, selection, and interaction. All of which are the basis of the following courses shared below.

Spring 2025: BiS801 AI Fundamentals in Biology and Health Technology
Spring 2025: BiS437 Bio-Data Engineering
Fall 2024: EB502 Programming for Engineering Biology
Fall 2024: BiS232 Bio-Data Structures
Spring 2024: BiS437 Bio-Data Engineering
Fall 2023: EB502 Programming for Engineering Biology
Fall 2023: BiS232 Bio-Data Structures
Spring 2023: BiS437 Bio-Data Engineering
Fall 2022: BiS232 Bio-Data Structures
Spring 2022: BiS800 Methods in Functional Genomics and Computational Molecular Biology
Fall 2021: BiS232 Bio-Data Structures
- Lecture 1 Introduction
- Lecture 3 Foundations of computational thinking
Spring 2021: BiS800 Methods in Functional Genomics and Computational Molecular Biology
- Lecture 1 Introduction
- Lecture 3 Data Science I

Research

The science of today is the technology of tomorrow.
– Barbara McClintock

Biology is not random, just largely unknown. There are almost an infinite amount of possible interactions, yet only a sparse handful constitutes a complex living system. To narrow down this vast search space, massive amounts of biological data are being generated to capture snapshots or snippets of the functional genome, multicellular heterogeneity, and complex human diseases. In this effort, bioinformatics algorithms play a key role in interpreting these large data collections and elucidating the underlying principles, both at the molecular and system levels.

The Young Laboratory at KAIST draws upon ideas from data science, applied statistics, and machine learning to tackle fundamental questions in quantitative biology. We incorporate problem-specific knowledge into the behavior of our algorithms to address the challenge of underspecification in modern machine learning methods. One of our primary objectives is to complete the human gene regulatory network. Specifically, we aim to map the missing regulatory axes of functional RNAs in terms of RNA modification, RNA structure, and protein-RNA interaction.

Projects

A RNA perspective of functional genomics

Only 2% of the human genome consists of protein-coding genes. The remaining 98% is non-coding and thought to encode the regulatory information for gene expression. Interpreting this non-coding region is thus key to understanding the functional genome and its implications for complex diseases. To tackle this, we take advantage of biological data generated from breakthroughs in chemical biology and bioengineering such as short- and long-read sequencing, oligosynthesis, chemical probing, and click chemistry.

In particular, we focus on elements of the genome that are transcribed into functional RNAs. Advances in biochemical and high-throughput techniques provide strong evidence that 74.7% of the human genome undergoes transcription, thus highlighting the importance of RNA research in functional genomics. The technology-specific computational tools built in our lab offer the means towards integrative genomics and functional interpretation. Our goal is to achieve this at single-nucleotide resolution across transcription, processing, modification, translation, decay, and other stages of the RNA life cycle.

No free lunch for emerging high-throughput technologies

It’s an exciting time to work in modern biology and bioengineering. Innovations in high-throughput techniques such as single-cell sequencing and spatial transcriptomics provide the means to extract deeper molecular insights in organismal development, immunology, and cancer biology. Existing computational models are being challenged and improved, placing us closer to unraveling the complexity of biological systems.

The algorithmic task here is to address inherent computational and statistical challenges in handling data generated from each high-throughput technology. This could be anything from applying the right data normalization to correcting batch effects for data integration. The key is to incorporate biology-specific knowledge into the design of computational tools, statistical models, and neural architectures. In light of this, our lab is committed to the development of these tailored methods, which we then use to extract quantitative principles underlying the biological data.

Combinatorial optimization in translational bioengineering

RNA therapeutics, genome editing, and artificial organoids represent just a few examples in biological engineering that are changing the way we solve human biology. However, these endeavors are often combinatorial optimization problems with near-infinite potential but intractable with brute-force algorithms. In RNA engineering, for example, there are more than 10⁶⁰ possible 100-nucleotide sequences with varying degrees of functionality. To put this into perspective, the estimated number of atoms on Earth is approximately 10⁵⁰ atoms only. This simple mathematical exercise indicates the limit of solely relying on high-throughput screening for sequence design and optimization.

Our approach involves building powerful search algorithms and intelligent systems to navigate this vast combinatorial space. Inspired by works in translational medicine, we leverage meaningful insights and principles from molecular biology and functional genomics for the computational optimization of bioengineering. We are interested in accelerating a wide range of applications including next-generation molecular devices for diagnosis and automatic systems for synthetic biology.

Team

We may have all come on different ships, but we’re in the same boat now.
– Martin Luther King, Jr.

Graduate students

Sooyoung Ko (Spring 2025)
- BSc in Bio and Brain Engineering @ KAIST
- BSc in Biological Sciences @ KAIST
- kos0889 “at” kaist.ac.kr
Juhee Lee (Spring 2025)
- BSc in Molecular Biology @ Pusan National University
- BSc in Statistics @ Pusan National University
- jhlee0 “at” kaist.ac.kr
Sungchul Yang (Spring 2025)
- BSc in Biotechnology @ Yonsei University
- BSc in Applied Statistics @ Yonsei University
- BSc in Mathematics @ Yonsei University
- pacific333 “at” kaist.ac.kr
Melissa LIaiqui-Condori (Fall 2024)
- MS in Medical Science and Engineering @ KAIST
- Medical Doctor @ Catholic University of Santa María
- mel.llc “at” kaist.ac.kr
Sunghyun Ha (Spring 2024)
- BSc in Statistics @ Korea University
- BSc in Life Science @ Korea University
- ha6411 “at” kaist.ac.kr
Michelle A. Sunartha (Fall 2023)
- BSc in Computer Science @ TaiwanTech
- michellesoen “at” kaist.ac.kr
Daniil Melnichenko (Fall 2023)
- BSc in Bio and Brain Engineering @ KAIST
- BSc in Chemistry @ KAIST
- dan.mb “at” kaist.ac.kr
Hyunju Kim (Fall 2023)
- BSc in Systems Biology @ Konkuk University
- hyunjukim “at” kaist.ac.kr
Juhyeon Kim (Spring 2023)
- MS in Bio and Brain Engineering @ KAIST
- BSc in Undergraduate Studies @ DGIST
- axdrgn “at” kaist.ac.kr
Suyeon Lee (Spring 2023)
- MS in Bio and Brain Engineering @ KAIST
- BSc in Systems Biomedical Science @ Soongsil University
- susu1010 “at” kaist.ac.kr
Hyeonjung Lee (Fall 2022)
- MS in Life Sciences @ POSTECH
- BSc in Biomedical Science @ UNIST
- BSc in Bioengineering @ UNIST
- hyeon “at” kaist.ac.kr
Hyeonggon Cho (Fall 2021)
- MS in Bio and Brain Engineering @ KAIST
- BSc in Pharmacy @ Seoul National University
- gudrhs6576 “at” kaist.ac.kr
Jongmin Lim (Fall 2021)
- MS in Bio and Brain Engineering @ KAIST
- BSc in Molecular Genetics @ University of Toronto
- jmlim2 “at” kaist.ac.kr

Administrative Assistant

Jae-bok Cho (2023 - )
- jbcho “at” kaist.ac.kr

Undergraduates

박유진 (Winter 2024) Bio and Brain Engineering @ KAIST
신국희 (Winter 2024) Freshman @ KAIST
Nel Zandre (Winter 2024) Bio and Brain Engineering @ KAIST
백하은 (Summer 2024) Biomedical Engineering @ UNIST
민채원 (Summer 2024) Biomedical Science @ Catholic University
오인혁 (Summer 2024) Bio and Brain Engineering @ KAIST
김주현 (Summer 2024) Computer Science @ KAIST
Baktynur Azhybaev (Winter 2023) Bio and Brain Engineering @ KAIST
양성철 (Winter 2023) Biotechnology @ Yonsei University
성달경 (Winter 2023) Biological Sciences @ Seoul National University
하성현 (Summer 2023) Life Science @ Korea University
Aleksandra J. Wisniewska (Summer 2023) Bio and Brain Engineering @ KAIST
김민정 (Winter 2022) Genetic Engineering @ Kyung Hee University
Shubhangi Kumar (Winter 2022) Computer Science @ KAIST
안규찬 (Winter 2022) Bio and Brain Engineering @ KAIST
Daniil Melnichenko (Summer 2022) Chemistry @ KAIST
김동근 (Summer 2022) Computer Science @ KAIST
김대원 (Summer 2022) Computer Science @ KAIST
김근희 (Spring 2022) Biological Sciences @ KAIST
김민주 (Spring 2022) Bio and Brain Engineering @ KAIST
박해준 (Winter 2022) Computer Science @ KAIST
Azamat Armanuly (Fall 2021) Biological Sciences @ KAIST
Benedict Fabia (Fall 2021) Freshman @ KAIST
정다현 (Summer 2021) Biological Sciences @ KAIST
이지현 (Summer 2021) Computer Science @ KAIST

Working

*equal contributions #corresponding author

The essence of strategy is choosing what not to do.
– Michael Porter

Designing 5′ UTR sequences improves the capacity of mRNA therapeutics in preclinical models of aging and obesity

Submitted

S Yoon* , H Cho* , J Lee , S Ha , S Lee , YS Lee , Y Cho , D Ha , A Oh , S Lee , JH Nam# , YS Lee#

Near-optimal variant calling by pseudo-database construction

Under Review

H Lee* , S Kim* , MA Sunartha , CY Lee# , YS Lee#

Packaging signal of SARS-CoV-2

Under Review

Y Park , J Lim , H Cho , A Son , CH Li , VN Kim# , YS Lee#

Publications

*equal contributions #corresponding author

Lee YS*# , Levdansky Y* , Jung Y , Kim VN# , Valkov E# (2024). Deadenylation kinetics of mixed poly (A) tails at single-nucleotide resolution. Nature Structural & Molecular Biology.

DOI pubmed Press 한빛사 인터뷰

Ku J* , Lee K* , Ku D , Kim S , Lee J , Bang H , Kim N , Do H , Lee H , Lim C , Han J , Lee YS# , Kim Y# (2024). Alternative polyadenylation determines the functional landscape of inverted Alu repeats. Molecular Cell.

DOI pubmed

Fabia B , Kim M , Lim J , Lee YS# (2023). Mathematical Modeling of mRNA Poly (A) Tail Shortening Process. Deadenylation: Methods and Protocols.

DOI pubmed

Kim JY* , Jeon K* , Hong JJ* , Park SI* , Cho H* , Park HJ* , Kwak HW , Park HJ , Bang YJ , Lee YS , Bae SH , Kim SH , Hwang KA , Jung DI , Cho SH , Seo SH , Kim G , Oh H , Lee HY , Kim KH , Lim HY , Jeon P , Lee JY , Chung J , Lee SM , Ko HL , Song M , Cho NH# , Lee YS# , Hong SH# , Nam JH# (2023). Heterologous vaccination utilizing viral vector and protein platforms confers complete protection against SFTSV. Scientific Reports.

DOI pubmed

J. Park , M. Kim , H. Yi , K. Baeg , Y. Choi , Y. S. Lee , J. Lim , V N. Kim# (2023). Short poly (A) tails are protected from deadenylation by the LARP1–PABP complex. Nature Structural & Molecular Biology.

DOI pubmed

See all publications

Coffeehouse

I confess I do not know why, but looking at the stars always makes me dream.
– Vincent Van Gogh

Here is the list of places I’d like to get my specific cup of coffee.

Daejeon, South Korea
- Coffee Office(커피오피스) : Hand Drip
- Intradorn(인트라던) : Adorn coffee
- Voila Café(브알라) : Sea Salt Americano
KAIST
- Café Dream : Ice Americano + extra shot
- 흥커피로스터스 : Latte + Scone
- Coffee1011(커피1011로스팅랩) : Americano
Seoul National University
- Gabean Coffee Roasters : Pour-Over (+refill)
- A Twosome Place : Long black + extra shot
- Hollys Coffee : Cold brew
Princeton, NJ
- Small World Coffee : A robust cup of Joe
- Starbucks : Espresso from their clover machine

Contact

youngl@kaist.ac.kr
+82-42-350-7924
1113 CMS(E16), 291 Daehak-ro, Yuseong-gu, Daejeon, 34141, South Korea
Enter Building E16 and take the elevator to Office 1113 on Floor 11