Yunho Oh
I am recruiting graduate students (Ph.D. preferred). Please contact me if you are interested in research on computer architecture.
I am an assistant professor in the School of Electrical Engineering at Korea University. I lead Computer Architecture and System Software Lab (ComSys). My research interests focus on designing next-generation computing systems, including hardware and software architectures for energy-efficient datacenters, processor architectures (CPUs, GPUs, and neural network accelerators), in-storage processing, memory system, and high-performance computing.
Prior to joining Korea University, I was an assistant professor in the Department of Electronic and Electrical Engineering at SungKyunKwan University (SKKU). From March 2019 to Feburary 2021, I worked as a postdoctoral researcher in PARSA at EPFL (PI: Professor Babak Falsafi). I completed Ph.D. in the School of Electrical and Electronic Engineering at Yonsei University (Advisor: Professor Won Woo Ro). From August 2016 to August 2017, I worked as a visiting graduate scholar at the University of Southern California (Faculty Sponsor: Professor Murali Annavaram). From 2011 to 2014, I worked as a software engineer at Mobile Communications Buniness, Samsung Electronics.
News
Aug. 2024: I will be serving as a program committee at MLSys 2025.
June 2024: A paper titled "VitBit: Enhancing Embedded GPU Performance for AI Workloads through Register Operand Packing" has been accepted at ICPP 2024.
Apr. 2024: I will be serving as an external review committee at MICRO 2024.
Mar. 2024: A paper titled "SAVector: Vectored Systolic Arrays" has been accepted at IEEE Access.
Feb. 2024: A paper titled "Conflict-Aware Compiler for Hierarchical Register File on GPUs" has been accepted at Journal of Systems Architecture.
Feb. 2024: I will be serving as a technical program committee at 17th IEEE International Symposium on Embedded Multicore/Manycore SoCs (MCSoC-2024).
Dec. 2023: A paper titled "Adaptive Kernel Merge and Fusion for Multi-Tenant Inference in Embedded GPUs" has been accepted at IEEE Embedded Systems Letters.
Oct. 2023: I will be serving as an organization committee at Fall Conference of IEIE 2023.
Oct. 2023: I will be serving as an organization committee at the 5th Workshop on Future and Challenges of Memory-Centric Computing.
Oct. 2023: I will be serving as a program committee at MLSys 2024.
Oct. 2023: I will be serving as a program committee at GPGPU 2024.
July 2023: I will be serving as a light program committee at HPCA 2024.
June 2023: A paper titled "Warped-MC: An Efficient Memory Controller Scheme for Massively Parallel Processors" has been accepted at ICPP 2023.
Mar. 2023: Two papers titled "R2D2: Removing ReDunDancy Utilizing Linearity of Address Generation in GPUs" and "Imprecise Store Exceptions" have been accepted at ISCA 2023.
Oct. 2022: Two papers titled "AstriFlash: A Flash-Based System for Online Services" and "SnakeByte: A TLB Design with Adaptive and Recursive Page Merging in GPUs" have been accepted at HPCA 2023.
Oct. 2022: A paper titled "Analyzing GCN Aggregation on GPU" has been accepted at IEEE Access.
Oct. 2022: A paper titled "GhostLeg: Selective Memory Coalescing for Secure GPU Architecture" has been accepted at IEEE Access.
Oct. 2022: A paper titled "Scale-Out Systolic Arrays" has been accepted at ACM Transactions on Architecture and Code Optimization.
Sep. 2022: I will be serving as a program committee at MLSYS 2023.
Sep. 2022: I have joined the School of Electrical Engineering and the School of Semiconductor Engineering at Korea University as an assistant professor.
Aug. 2022: I have given a talk at the AI-SRC Summer Camp at Seoul National University.
July 2022: A paper titled "TEA-RC: Thread Context-Aware Register Cache for GPUs" has been accepted at IEEE Access.
Apr. 2022: I will be serving as an external review committee at MICRO 2022.
Apr. 2022: I will be serving as a local arrangement chair at HPCA 2022.
Mar. 2022: A paper titled "CASH-RF: A Compiler-Assisted Hierarchical Register File in GPUs" has been accepted at IEEE Embedded Systems Letters.
Mar. 2022: I will be serving as a program committee at ACM Student Research Competition (ACM SRC) Grand Finals.
Mar. 2022: I will be serving as a session chair (Session 2A: GPU and Data Analytics) at ASPLOS 2022.
Feb. 2022: A paper titled "FLIXR: Embedding Index into Flash Translation Layer in SSDs" has been accepted at IEEE Transactions on Computers.
Feb. 2022: I will be serving as a tutorial chair at ICEIC 2022.
Jan. 2022: I have given a talk at Seoul National University.
July 2021: I will be serving as a program committee at MLSYS 2022.
June 2021: I will be serving as the poster session chair at ASPLOS 2022.
Mar. 2021: A paper titled "Rebooting Virtual Memory with Midgard" has been accepted at ISCA 2021. Big congrats, Sid!
Mar. 2021: I will be serving as an external review committee at MICRO 2021.
Education
Aug. 2018
Ph.D. in the School of Electrical and Electronic Engineering, Yonsei University, Seoul, Korea
Feb. 2011
M.S. in the School of Electrical and Electronic Engineering, Yonsei University, Seoul, Korea
Feb. 2009
B.S. in the School of Electrical and Electronic Engineering, Yonsei University, Seoul, Korea
Publications
Conferences and Journals
ICPP 2024
VitBit: Enhancing Embedded GPU Performance for AI Workloads through Register Operand Packing
Jaebeom Jeon, Minseong Gil, Junsu Kim, Jaeyong Park, Gunjae Koo, Myung Kuk Yoon, and Yunho Oh
The 53rd International Conference on Parallel Processing (ICPP 2024), Gotland, Sweden, 2024.
IEEE Access
SAVector: Vectored Systolic Arrays
Sangun Choi, Seongjun Park, Jaeyong Park, Jongmin Kim, Gunjae Koo, Seokin Hong, Myung Kuk Yoon, and Yunho Oh
accepted at IEEE Access.
JSA
Conflict-Aware Compiler for Hierarchical Register File on GPUs
Eunbi Jeong, Eun Seong Park, Gunjae Koo, Yunho Oh*, and Myung Kuk Yoon*
accepted at Journal of Systems Architecture (*co-corresponding authors).
IEEE ESL
Adaptive Kernel Merge and Fusion for Multi-Tenant Inference in Embedded GPUs
Jaebeom Jeon, Gunjae Koo, Myung Kuk Yoon, and Yunho Oh
accepted at IEEE Embedded Systems Letters.
MICRO 2023
MAD MAcce: Supporting Multiply-Add Operations for Democratizing Matrix-Multiplication Accelerator
Seunghwan Sung, Sujin Hur, Sungwoo Kim, Dongho Ha, Yunho Oh, and Won Woo Ro
The 56th IEEE/ACM International Symposium on Microarchitecture (MICRO 2023), Toronto, Canada, 2023.
ICPP 2023 (Best Paper Award)
Warped-MC: An Efficient Memory Controller Scheme for Massively Parallel Processors
Jonghyun Jeong, Yunho Oh, Myung Kuk Yoon, and Gunjae Koo
The 52nd International Conference on Parallel Processing (ICPP 2023), Salt Lake City, Utah, 2023.
ISCA 2023
R2D2: Removing ReDunDancy Utilizing Linearity of Address Generation in GPUs
Dongho Ha, Yunho Oh, and Won Woo Ro
The 50th ACM/IEEE International Symposium on Computer Architecture (ISCA-50), Orlando, Florida, 2023.
ISCA 2023
Imprecise Store Exceptions
Siddharth Gupta, Yuanlong Li, Qingxuan Kang, Abhishek Bhattacharjee, Babak Falsafi, Yunho Oh, and Mathias Payer
The 50th ACM/IEEE International Symposium on Computer Architecture (ISCA-50), Orlando, Florida, 2023.
HPCA 2023
AstriFlash: A Flash-Based System for Online Services
Siddharth Gupta, Yunho Oh, Lei Yan, Mark Sutherland, Abhishek Bhattacharjee, Babak Falsafi, and Peter Hsu
The 29th IEEE International Symposium on High-Performance Computer Architecture (HPCA-29), Montreal, Quebec, Canada, 2023.
HPCA 2023
SnakeByte: A TLB Design with Adaptive and Recursive Page Merging in GPUs
Jiwon Lee, Ju Min Lee, Yunho Oh, William Song, and Won Woo Ro
The 29th IEEE International Symposium on High-Performance Computer Architecture (HPCA-29), Montreal, Quebec, Canada, 2023.
ACM TACO
Scale-Out Systolic Arrays
Ahmet Caner Yüzügüler, Canberk Sönmez, Mario Drumond, Yunho Oh, Babak Falsafi, and Pascal Frossard
ACM Transactions on Architecture and Code Optimization, vol. 20, issue 2, no. 27, pp. 1-25, Mar. 2023.
IEEE TC
FLIXR: Embedding Index into Flash Translation Layer in SSDs
Gunjae Koo*, Yunho Oh*, Hung-Wei Tseng, Won Woo Ro, and Murali Annavaram
IEEE Transactions on Computers (*equally contributed as the first authors), vol. 72, no. 3, pp. 250-263, Jan. 2023.
IEEE ESL
CASH-RF: A Compiler-Assisted Hierarchical Register File in GPUs
Yunho Oh, Ipoom Jeong, Won Woo Ro, and Myung Kuk Yoon
IEEE Embedded Systems Letters, vol. 14, pp. 187-190, Dec. 2022.
IEEE Access
Analyzing GCN Aggregation on GPU
Inje Kim, Jonghyun Jeong, Yunho Oh, Myung Kuk Yoon, and Gunjae Koo
IEEE Access, vol. 10, pp. 113046-113060, Nov. 2022.
IEEE Access
GhostLeg: Selective Memory Coalescing for Secure GPU Architecture
Jongmin Lee, Seungho Jung, Taewon Suh, Yunho Oh, Myung Kook Yoon, and Gunjae Koo
IEEE Access, vol. 10, pp. 111449-111462, Nov. 2022.
IEEE Access
TEA-RC: Thread Context-Aware Register Cache for GPUs
Ipoom Jeong, Yunho Oh, Won Woo Ro, and Myung Kuk Yoon
IEEE Access, vol. 10, pp. 82049-82062, August. 2022.
ISCA 2021
Rebooting Virtual Memory with Midgard
Siddharth Gupta, Atri Bhattacharyya, Yunho Oh, Abhishek Bhattacharjee, Babak Falsafi, and Mathias Payer
The 48th ACM/IEEE International Symposium on Computer Architecture, virtual conference, 2021.
MICRO 2020
Duplo: Lifting Redundant Memory Accesses of Deep Neural Networks for GPU Tensor Cores
Hyeonjin Kim, Sungwoo Ahn, Yunho Oh, Bogil Kim, Won Woo Ro, and William J. Song
The 53rd IEEE/ACM International Symposium on Microarchitecture, virtual conference, 2020.
ISCA 2019
Linebacker: Preserving Victim Cache Lines in Idle Register Files of GPUs
Yunho Oh, Gunjae Koo, Murali Annavaram, and Won Woo Ro
The 46th ACM/IEEE International Symposium on Computer Architecture, Phoenix, AZ, USA, 2019.
IEEE TC
Adaptive Cooperation of Prefetching and Warp Scheduling on GPUs
Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Murali Annavaram, and Won Woo Ro
IEEE Transactions on Computers, vol. 68, no. 4, pp. 609-616, April. 2019.
MICRO 2018
FineReg: Augmenting GPU Throughput via Fine-Grained Register File Management
Yunho Oh, Myung Kuk Yoon, William J. Song, and Won Woo Ro
The 51st ACM/IEEE International Symposium on Microarchitecture, Fukuoka, Japan, 2018
IEEE TC
WASP: Selective Data Prefetching with Monitoring Runtime Warp Progress on GPUs
Yunho Oh, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, and Won Woo Ro
IEEE Transactions on Computers, vol. 67, no. 9, pp. 1366-1373, Sept. 2018.
IEEE TPDS
Dynamic Resizing on Active Warps Scheduler to Hide Operation Stalls on GPUs
Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
IEEE Transactions on Parallel and Distributed Systems, vol. 28, no. 11, pp. 3142-3156, Nov. 2017.
ISCA 2017
Access Pattern-Aware Cache Management for Improving Data Utilization in GPU
Gunjae Koo, Yunho Oh, Won Woo Ro, and Murali Annavaram
The 44th ACM/IEEE International Symposium on Computer Architecture, Toronto, ON, Canada, 2017.
ISCA 2016
APRES: Improving Cache Efficiency by Exploiting Load Characteristics on GPUs
Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Won Woo Ro, and Murali Annavaram
The 43rd ACM/IEEE International Symposium on Computer Architecture, Seoul, Korea, 2016.
ISPASS 2015
DRAW: Investigating Benefits of Adaptive Fetch Group Size on GPU
Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro
The 2015 IEEE International Symposium on Performance Analysis of Systems and Software, Philadelphia, PA, USA, 2015.
ITC-CSCC 2015
Improving Pipeline Utilization with Two-Level Instruction Issue on GPUs
Yunho Oh, Jong Hyun Park, and Won Woo Ro
The 30th International Technical Conference on Circuits/Systems, Computers and Communications, Seoul, Korea, 2015.
KIISE
Introduction to Researches on Performance Bottlenecks of Many-Core GPU Architectures
Yunho Oh, Myung Kuk Yoon, Jong Hyun Park, and Won Woo Ro
Communications of KIISE, Vol. 32 No. 5, May, 2014.
IJPP
GPU-Friendly Parallel Genome Matching with Tiled Access and Reduced State Transition Table
Yunho Oh, Doohwan Oh, and Won W. Ro
International Journal of Parallel Programming, Vol. 41, No. 4, pp. 526-551, August, 2013.
ICEIC 2010
Multi-Threaded Filtered BackProjection Algorithm on Multi-Core Processors
Yun H. Oh, and Won W. Ro
The 10th International Conference on Electronics, Information, and Communication, Cebu, Philippines, 2010.
ISMRM 2010
Accelerated Reconstruction Using Parallel Computing for Spiral Spectroscopic Imaging
Dong H. Kim, Yun H. Oh, Yun H. Nam, M. Gu, and Won W. Ro
International Society for Magnetic Resonance in Medicine Annual Meeting, Stockholm, Sweden, 2010.
ELEX
Hardware Implementation of a Tessellation Accelerator for the OpenVG Standard
Seung Hun Kim, Yunho Oh, Karam Park, and Won W. Ro
IEICE Electronics Express, Vol. 7, No. 6, pp 440-446, March, 2010.
Workshops
MLArchSys
Accuracy Boosters: Epoch-Driven Mixed-Mantissa Block Floating Point for DNN Training
Simla Burcu Harma, Ayan Chakraborty, Babak Falsafi, Martin Jaggi, and Yunho Oh
ML for Computer Architecture and Systems co-located with ISCA 2023.
SPMA
AstriFlash: An Online Flash-Based Memory Hierarchy
Siddharth Gupta, Yunho Oh, Lei Yan, Mark Sutherland, Abhishek Bhattacharjee, Babak Falsafi, and Peter Hsu
The 10th Workshop on Systems for Post-Moore Architectures co-located with Eurosys 2020.
HENND
Accelerating Neural Network with Selective Thread-Level Parallelism Regulation and Cache Bypassing on GPUs
Kwanghee Chang, Yunho Oh, Myung Kuk Yoon, and Won Woo Ro
International Workshop on Highly Efficient Neural Networks Design in conjunction with ESWEEK 2017.
Patent
Operation device of convolutional neural network, operation method of convolutional neural network and computer program stored in a recording medium to execute the method thereofWilliam Jinho Song, Won Woo Ro, Hyeonjin Kim, Sungwoo Ahn, Yunho Oh, and Bogil Kim
US Patent 20220391320A1. Central processing unit, GPU simulation method thereof, and computing system including the same
Won Woo Ro, Karam Park, Yunho Oh, Sangpil Lee, and Minwoo Kim
US Patent 9,378,533.
Experience
Professional Experience
Sep. 2022 - Current  Assistant Professor, Korea University  
Feb. 2021 - Aug. 2022  Assistant Professor, SungKyunKwan University  
Mar. 2019 - Feb. 2021  Postdoctoral Researcher, École Polytechnique Fédérale de Lausanne  
Principal Investigator: Prof. Babak Falsafi
- New memory system architecture for cost-efficient datacenters
- Microarchitecture-specific server workload behavior analysis
Sep. 2018 - Jan. 2019.  Postdoctoral Researcher, Yonsei University  
- New register file architecture for GPUs with new memory technology
- Energy-efficient neural network processing on GPUs
- Entropy analysis on GPU register compression
Mar. 2014 - Aug. 2018.  Graduate Research Assistant (Ph.D.), Yonsei University  
- Memory locality-aware warp scheduling technique on GPUs
- Efficient prefetching techniques on GPUs
- Efficient register file management on GPUs
- Energy-efficient memory system for GPUs
- Energy-efficient multi-core processor architecture
Aug. 2016 - Aug. 2017.   Visiting Graduate Scholar, University of Southern California, LA, CA  
Faculty Sponsor: Prof. Murali Annavaram
- Efficient cache management technique on GPUs
- In-SSD database index management system
Feb. 2011 - Jan. 2014.   Software Engineer, Mobile Communications Business, Samsung Electronics  
- Developed mobile applications for Android smartphones
- Developed a cloud service framework for wearable devices
Mar. 2009 - Feb. 2011.   Graduate Research Assistant (M.S.), Yonsei University  
- Parallel genome sequence matching algorithm for GPUs
- High-performance medical image construction algorithms
- OpenVG accelerator prototype design
Teaching Experience
At Korea University Sep. 2022 ~ Current.
System Software
Computer Systems
Microprocessors
Digital Systems
Object-Oriented Programming
Engineering Mathematic I
At Sungkyunkwan University Feb. 2021~ Aug. 2022.
Logic Design Laboratory
Research Projects for Execellent Undergraduates
Embedded System Designs
At EPFL Mar. 2019~ Feb. 2021.
Topics on ML Systems (CS723)
Topics on Datacenter Design (CS728)
Semester Project
Introduction to Multiprocessor Architecture (CS307)
At Yonsei University
GPGPU Programming Practice (Samsung Advanced Technology Training Institute)
Computer Architecture (TA)
Electrical and Electronic Engineering Experiments (TA)
Talks
Oct. 2023.   Designing Scale-Out Systems in Post-Moore Era
SK Hynix Design Summit
Oct. 2023.   Numerical Encoding and DNN Training in Post-Moore Era
ICCE-Asia
Sep. 2023.   SK Semiconductor Education Program
SK Group
Aug. 2022.   Improving Energy Efficiency of AI Acceleration Systems
AISRC, Seoul National University
Jan. 2022.   Designing Scale-Out Data-Centric Systems: Virtual Memory and Accelerators
Seoul National University (Virtual)
Aug. 2021.   Data-Centric Computing, Brick by Brick
2021 SIGARCH Korea Workshop (Virtual)
Aug. 2021.   Towards Energy-Efficient DNN Acceleration Systems
2021 Summer School, Korean Society of Artificial Intelligent in Medicine (Virtual)
June. 2021.   Challenges in Next-Generation Datacenters: Accelerators and Memory Systems
IT21 Conference (Virtual)
June. 2020.   Designing Workload-Aware GPU Architectures 
HiSilicon (Remote talk)
Sep. 2019.   Resolving Critical GPU Performance Bottlenecks 
Hongik University, Seoul, Korea
Nov. 2018.   Rethinking GPU Execution Model 
EPFL, Lausanne, VD, Switzerland
Oct. 2018.   Towards Extreme Throughput 
EPFL, Lausanne, VD, Switzerland
Dec. 2018.   FineReg: Augmenting GPU Throughput via Fine-Grained Register File Management 
Invited Talk in Software Congress 2018, PyeongChang, Korea
Services
Organization Committee
Fall Conference of IEIE, 2023
5th Workshop on Future and Challenges of Memory-Centric Computing, 2023
ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022
IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2022
Program Committee
IEEE International Symposium on Embedded Multicore/Manycore SoCs (MCSoC-2024)
Conference on Systems and Machine Learning (MLSys), 2024
Workshop on General Purpose Processing using GPUs (GPGPU), 2024
IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2024 (Light PC)
Conference on Systems and Machine Learning (MLSys), 2023
ACM Student Research Competition (SRC), 2022
Conference on Systems and Machine Learning (MLSys), 2022
Workshop on the Computing Continuum (in HiPEAC), 2020
External Review Committee
ACM/IEEE International Symposium on Microarchitecture (MICRO), 2024
ACM/IEEE International Symposium on Microarchitecture (MICRO), 2022
ACM/IEEE International Symposium on Microarchitecture (MICRO), 2021
IEEE/ACM International Symposium on Microarchitecture (MICRO), 2020
ACM/IEEE International Symposium on Computer Architecture (ISCA), 2019
Session Chair
Session 2A: GPU and Data Analytics: ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022
Reviewer
IEEE Transactions on Very Large Scale Integration Systems, 2024
IEEE Transactions on Computers, 2024
IEEE Transactions on Parallel and Distributed Systems, 2024
ACM Transactions on Architecture and Code Optimization, 2024
IEEE Computer Architecture Letters, 2024
IEEE Computer Architecture Letters, 2023
Parallel Computing, 2023
ACM Transactions on Design Automation of Electronic Systems, 2022
IEEE Computer Architecture Letters, 2022
IEEE Transactions on Emerging Topics in Computing, 2021
IEEE Computer Architecture Letters, 2021
Future Generation Computer Systems, 2020
IEEE Computer Architecture Letters, 2019
IEEE Access, 2019
Tutorial Chair
International Conference on Electronics, Information, and Communication (ICEIC), 2022
Honors and Awards
Sep. 2023.   Excellent Lecture Award  
Korea University
Mar. 2023.   Excellent Lecture Award  
Korea University
Nov. 2017.   Excellent Graduate Researcher Scholarship 
Yonsei University
Aug. 2016 - Aug. 2017.   BK21 Long-Term Overseas Research Fellowship 
Yonsei University
Mar. 2009 - Feb. 2011.   Mobile Communication Graduate Fellowship 
Samsung Electronics
Feb. 2008.   Excellent Undergraduate Awarded Honors and Scholarship 
Yonsei University
Contact
E-mail   [firstname]_[lastname] at korea dot ac dot kr
LinkedIn   https://www.linkedin.com/in/yunho-oh-comarch/