Yunho Oh
I am recruiting Ph.D. students. Please contact me if you are interested in research on computer architecture.
I am an associate professor in the School of Electrical Engineering at Korea University. I lead Computer Architecture and System Software Lab (ComSys). My research interests focus on designing next-generation computing systems, including hardware and software architectures for energy-efficient datacenters, processor architectures (CPUs, GPUs, and neural network accelerators), in-storage processing, memory system, and high-performance computing.
Prior to joining Korea University, I was an assistant professor in the Department of Electronic and Electrical Engineering at SungKyunKwan University (SKKU). From March 2019 to Feburary 2021, I worked as a postdoctoral researcher in PARSA at EPFL (PI: Professor Babak Falsafi). I completed Ph.D. in the School of Electrical and Electronic Engineering at Yonsei University (Advisor: Professor Won Woo Ro). From August 2016 to August 2017, I worked as a visiting graduate scholar at the University of Southern California (Faculty Sponsor: Professor Murali Annavaram). From 2011 to 2014, I worked as a software engineer at Mobile Communications Buniness, Samsung Electronics.
News
Sep. 2025: I will be serving as an organization committee at ISPASS 2026.
Sep. 2025: I will be serving as a program committee at ISPASS 2026.
Sep. 2025: I have been promoted to Associate Professor at Korea University.
Aug. 2025: A paper titled "HALO: Hybrid Systolic Arrays via Logical Partitioning for Acceleration of Complex-Valued Neural Networks" has been accepted at IISWC 2025.
Aug. 2025: A paper titled "FINEA: An Efficient Neural Network Accelerator Exploiting Factorized Input Features" has been accepted at ICCD 2025.
July 2025: I will be serving as a TPC co-chair at ICEIC 2026.
July 2025: I will be serving as a program committee at HPCA 2026.
June 2025: I will be serving as a session chair at LCTES 2025.
June 2025: A papers titled "MOST: Memory Oversubscription-aware Scheduling for Tensor Migration on GPU Unified Storage" has been accepted at IEEE CAL.
Apr. 2025: Two papers titled "SSFFT: Energy-Efficient Selective Scaling for Fast Fourier Transform in Embedded GPUs" and "Kubism: Disassembling and Reassembling K-Means Clustering for Mobile Heterogeneous Platforms" have been accepted at LCTES 2025.
Mar. 2025: A paper titled "Avant-Garde: Empowering GPUs with Scaled Numeric Formats" has been accepted at ISCA 2025.
Mar. 2025: A paper titled "Hierarchical Traversal Stack Design Using Shared Memory for GPU Ray Tracing" has been accepted at ISPASS 2025.
Feb. 2025: I will be serving as a program committee at MICRO 2025.
Feb. 2025: I will be serving as a organization committee at MICRO 2025.
Feb. 2025: A paper titled "Beyond VABlock: Improving Transformer Workloads through Aggressive Prefetching" has been accepted at JSA.
Feb. 2025: A paper titled "TM-Training: An Energy-Efficient Tiered Memory System for Deep Learning Training in NPUs" has been accepted at ACM TOS.
Feb. 2025: A paper titled "An Entropy Model for GPU Register Compression" has been accepted at Journal of Semiconductor Technology and Science.
Jan. 2025: A paper titled "Effective Interplay between Sparsity and Quantization: From Theory to Practice" has been accepted at ICLR (Spotlight).
Jan. 2025: A paper titled "A Low-latency On-chip Cache Hierarchy for Load-to-use Stall Reduction in GPUs" has been accepted at ACM TACO.
Dec. 2024: I will be serving as a program committee at ISPASS 2025.
Nov. 2024: A paper titled "HyMM: A Hybrid Sparse-Dense Matrix Multiplication Accelerator for GCNs" has been accepted at DATE 2025.
Nov. 2024: A paper titled "Marching Page Walks: Batching and Concurrent Page Table Walks for Enhancing GPU Throughput" has been accepted at HPCA 2025.
Oct. 2024: A paper titled "TLP Balancer: Predictive Thread Allocation for Multi-Tenant Inference in Embedded GPUs" has been accepted at IEEE Embedded Systems Letters.
Aug. 2024: I will be serving as a program committee at MLSys 2025.
June 2024: A paper titled "VitBit: Enhancing Embedded GPU Performance for AI Workloads through Register Operand Packing" has been accepted at ICPP 2024.
Apr. 2024: I will be serving as an external review committee at MICRO 2024.
Mar. 2024: A paper titled "SAVector: Vectored Systolic Arrays" has been accepted at IEEE Access.
Feb. 2024: A paper titled "Conflict-Aware Compiler for Hierarchical Register File on GPUs" has been accepted at Journal of Systems Architecture.
Feb. 2024: I will be serving as a technical program committee at 17th IEEE International Symposium on Embedded Multicore/Manycore SoCs (MCSoC-2024).
Dec. 2023: A paper titled "Adaptive Kernel Merge and Fusion for Multi-Tenant Inference in Embedded GPUs" has been accepted at IEEE Embedded Systems Letters.
Oct. 2023: I will be serving as an organization committee at Fall Conference of IEIE 2023.
Oct. 2023: I will be serving as an organization committee at the 5th Workshop on Future and Challenges of Memory-Centric Computing.
Oct. 2023: I will be serving as a program committee at MLSys 2024.
Oct. 2023: I will be serving as a program committee at GPGPU 2024.
July 2023: I will be serving as a light program committee at HPCA 2024.
June 2023: A paper titled "Warped-MC: An Efficient Memory Controller Scheme for Massively Parallel Processors" has been accepted at ICPP 2023.
Mar. 2023: Two papers titled "R2D2: Removing ReDunDancy Utilizing Linearity of Address Generation in GPUs" and "Imprecise Store Exceptions" have been accepted at ISCA 2023.
Oct. 2022: Two papers titled "AstriFlash: A Flash-Based System for Online Services" and "SnakeByte: A TLB Design with Adaptive and Recursive Page Merging in GPUs" have been accepted at HPCA 2023.
Oct. 2022: A paper titled "Analyzing GCN Aggregation on GPU" has been accepted at IEEE Access.
Oct. 2022: A paper titled "GhostLeg: Selective Memory Coalescing for Secure GPU Architecture" has been accepted at IEEE Access.
Oct. 2022: A paper titled "Scale-Out Systolic Arrays" has been accepted at ACM Transactions on Architecture and Code Optimization.
Sep. 2022: I will be serving as a program committee at MLSYS 2023.
Sep. 2022: I have joined the School of Electrical Engineering and the School of Semiconductor Engineering at Korea University as an assistant professor.
Aug. 2022: I have given a talk at the AI-SRC Summer Camp at Seoul National University.
July 2022: A paper titled "TEA-RC: Thread Context-Aware Register Cache for GPUs" has been accepted at IEEE Access.
Apr. 2022: I will be serving as an external review committee at MICRO 2022.
Apr. 2022: I will be serving as a local arrangement chair at HPCA 2022.
Mar. 2022: A paper titled "CASH-RF: A Compiler-Assisted Hierarchical Register File in GPUs" has been accepted at IEEE Embedded Systems Letters.
Mar. 2022: I will be serving as a program committee at ACM Student Research Competition (ACM SRC) Grand Finals.
Mar. 2022: I will be serving as a session chair (Session 2A: GPU and Data Analytics) at ASPLOS 2022.
Feb. 2022: A paper titled "FLIXR: Embedding Index into Flash Translation Layer in SSDs" has been accepted at IEEE Transactions on Computers.
Feb. 2022: I will be serving as a tutorial chair at ICEIC 2022.
Jan. 2022: I have given a talk at Seoul National University.
July 2021: I will be serving as a program committee at MLSYS 2022.
June 2021: I will be serving as the poster session chair at ASPLOS 2022.
Mar. 2021: A paper titled "Rebooting Virtual Memory with Midgard" has been accepted at ISCA 2021. Big congrats, Sid!
Mar. 2021: I will be serving as an external review committee at MICRO 2021.
Education
Aug. 2018 
Ph.D. in the School of Electrical and Electronic Engineering, Yonsei University, Seoul, Korea 
Feb. 2011 
M.S. in the School of Electrical and Electronic Engineering, Yonsei University, Seoul, Korea 
Feb. 2009 
B.S. in the School of Electrical and Electronic Engineering, Yonsei University, Seoul, Korea 
Publications
Conferences and Journals
IISWC 2025 
                       			HALO: Hybrid Systolic Arrays via Logical Partitioning for Acceleration of Complex-Valued Neural Networks 
                        		 Ji Yeong Yi, Eunbi Jeong, SungHee Yum, Jane Rhee, Sangun Choi, Gunjae Koo, Yunho Oh, Myung Kuk Yoon
                        		Accepted at IEEE International Symposium on Workload Characterization (IISWC 2025), Irvine, California, USA, 2025.
                        		
ICCD 2025 
                       			FINEA: An Efficient Neural Network Accelerator Exploiting Factorized Input Features 
                        		 Yujin Kim, Chanhun Jeong, Yunho Oh, Myung Kuk Yoon and Gunjae Koo
                        		Accepted at IEEE International Conference on Computer Design (ICCD 2025), Dallas, USA, 2025.
                        		
IEEE CAL  
                             	MOST: Memory Oversubscription-aware Scheduling for Tensor Migration on GPU Unified Storage
                              	Junsu Kim, Jaebeom Jeon, Jaeyong Park, Sangun Choi, Minseong Gil, Seokin Hong, Gunjae Koo, Myung Kuk Yoon, and Yunho Oh
                              	Accepted at IEEE Computer Architecture Letters.
                             	
LCTES 2025  
                             	SSFFT: Energy-Efficient Selective Scaling for Fast Fourier Transform in Embedded GPUs
                              	Dongwon Yang, Jaebeom Jeon, Minseong Gil, Junsu Kim, Seondeok Kim, Gunjae Koo, Myung Kuk Yoon, and Yunho Oh
                              	The 26th ACM SIGPLAN/SIGBED International Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES 2025), Seoul, Korea, 2025.
                             	
LCTES 2025  
                              	Kubism: Disassembling and Reassembling K-Means Clustering for Mobile Heterogeneous Platforms
   
                              	Seondeok Kim*, Sangun Choi*, Jaebeom Jeon, Junsu Kim, Minseong Gil, Jaehyeok Ryu, and  Yunho Oh
                              	(*: Co-first authors)
                              	The 26th ACM SIGPLAN/SIGBED International Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES 2025), Seoul, Korea, 2025.
                             	
ISCA 2025  
                       			Avant-Garde: Empowering GPUs with Scaled Numeric Formats 
                        		Minseong Gil,  Dongho Ha, Simla Burcu Harma, Myung Kuk Yoon, Babak Falsafi, Won Woo Ro, and Yunho Oh
                        		The 52nd IEEE/ACM IInternational Symposium on Computer Architecture (ISCA 2025).
                        		
ISPASS 2025 (Best Paper Honorable Mention)  
                       			Hierarchical Traversal Stack Design Using Shared Memory for GPU Ray Tracing 
                        		Eunsoo Jung, Eunbi Jeong, Gunjae Koo, Yunho Oh*, and Myung Kuk Yoon*
                        		Accepted at IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS 2025), (*co-corresponding authors).
                        		
JSA  
                       			Beyond VABlock: Improving Transformer Workloads through Aggressive Prefetching 
                        		Jane Rhee, Ikyoung Choi, Gunjae Koo, Yunho Oh*, and Myung Kuk Yoon*
                        		Accepted at Journal of Systems Architecture. (*co-corresponding authors).
                        		
ACM TOS   
		                        TM-Training: An Energy-Efficient Tiered Memory System for Deep Learning Training in NPUs 
		                        Jaeyong Park, Sangun Choi, Jongmin Kim, Gunjae Koo, Myung Kuk Yoon, and Yunho Oh
		                        Accepted at ACM Transactions on Architecture and Storage.
		                        
ICLR 2025 (Spotlight)  
                              	Effective Interplay between Sparsity and Quantization: From Theory to Practice 
                              	Simla Burcu Harma, Ayan Chakraborty, Elizaveta Kostenok, Danila Mishin, Dongho Ha, Babak Falsafi, Martin Jaggi, Ming Liu, Yunho Oh, Suvinay Subramanian, and Amir Yazdanbakhsh
                              	Accepted at The Thirteenth International Conference on Learning Representations (ICLR 2025), Singapore, 2025.
                              	
ACM TACO  
		                        A Low-latency On-chip Cache Hierarchy for Load-to-use Stall Reduction in GPUs 
		                        Negin Mahani, Hajar Falahati, Sina Darabi, Ahmad Javadi-Nezhad, Yunho Oh, Mohammad Sadrosadati, Hamid, Sarbazi-Azad, and Babak Falsafi
		                        Accepted at ACM Transactions on Architecture and Code Optimization.
		                        
HPCA 2025  
                              	Marching Page Walks: Batching and Concurrent Page Table Walks for Enhancing GPU Throughput 
                              	Jiwon Lee, Gun Ko, Myung Kuk Yoon, Ipoom Jeong, Yunho Oh,  and Won Woo Ro
                              	Accepted at The 31st IEEE International Symposium on High-Performance Computer Architecture (HPCA-31), Las Vegas, Nevada, USA, 2025.
                              	
DATE 2025  
                              	HyMM: A Hybrid Sparse-Dense Matrix Multiplication Accelerator for GCNs 
                              	Hunjong Lee, Jihun Lee, Jaewon Seo, Yunho Oh, Myungkuk Yoon and Gunjae Koo
                              	Accepted at Design, Automation and Test in Europe Conference (DATE 2025), Lyon, France, 2025.
                              	
IEEE ESL  
                              	TLP Balancer: Predictive Thread Allocation for Multi-Tenant Inference in Embedded GPUs 
                              	Minseong Gil, Jaebeom Jeon, Junsu Kim, Sangun Choi, Gunjae Koo, Myung Kuk Yoon, and Yunho Oh
                              	Accepted at IEEE Embedded Systems Letters.
                              	
IEEK JSTS  
                              	An Entropy Model for GPU Register Compression 
                              	Minsik Kim, Yunho Oh, and Won Woo Ro
                              	Accepted at Journal of Semiconductor Technology and Science.
                              	
ICPP 2024 
                              	VitBit: Enhancing Embedded GPU Performance for AI Workloads through Register Operand Packing
                              	Jaebeom Jeon,  Minseong Gil,  Junsu Kim, Jaeyong Park, Gunjae Koo, Myung Kuk Yoon, and Yunho Oh
                              	The 53rd International Conference on Parallel Processing (ICPP 2024), Gotland, Sweden, 2024.
                              	
IEEE Access  
	                            SAVector: Vectored Systolic Arrays 
	                            Sangun Choi, Seongjun Park, Jaeyong Park, Jongmin Kim, Gunjae Koo, Seokin Hong, Myung Kuk Yoon, and Yunho Oh
	                            IEEE Access, Vol. 12, pp. 44446-44461, 2024.
	                            
JSA  
                       			Conflict-Aware Compiler for Hierarchical Register File on GPUs 
                        		Eunbi Jeong, Eun Seong Park, Gunjae Koo, Yunho Oh*, and Myung Kuk Yoon*
                        		Journal of Systems Architecture, Volume 149, 2024, pp. 103099 (*co-corresponding authors).
                        		
IEEE ESL  
                       			Adaptive Kernel Merge and Fusion for Multi-Tenant Inference in Embedded GPUs 
                        		Jaebeom Jeon, Gunjae Koo, Myung Kuk Yoon, and Yunho Oh
                        		IEEE Embedded Systems Letters, vol. 16, no. 4, pp. 421-424, Dec. 2024.
                        		
MICRO 2023  
		                        MAD MAcce: Supporting Multiply-Add Operations for Democratizing Matrix-Multiplication Accelerator
		                        Seunghwan Sung,  Sujin Hur, Sungwoo Kim, Dongho Ha, Yunho Oh,  and Won Woo Ro
		                        The 56th IEEE/ACM International Symposium on Microarchitecture (MICRO 2023), Toronto, Canada, 2023.
		                        
ICPP 2023 (Best Paper Award) 
		                        Warped-MC: An Efficient Memory Controller Scheme for Massively Parallel Processors
		                        Jonghyun Jeong,  Yunho Oh,  Myung Kuk Yoon, and Gunjae Koo
		                        The 52nd International Conference on Parallel Processing (ICPP 2023), Salt Lake City, Utah, 2023.
		                        
ISCA 2023  
		                        R2D2: Removing ReDunDancy Utilizing Linearity of Address Generation in GPUs
		                        Dongho Ha, Yunho Oh,  and Won Woo Ro
		                        The 50th ACM/IEEE International Symposium on Computer Architecture (ISCA-50), Orlando, Florida, 2023.
		                        
ISCA 2023  
		                        Imprecise Store Exceptions
		                        Siddharth Gupta, Yuanlong Li, Qingxuan Kang, Abhishek Bhattacharjee, Babak Falsafi, Yunho Oh,  and Mathias Payer
		                        The 50th ACM/IEEE International Symposium on Computer Architecture (ISCA-50), Orlando, Florida, 2023.
		                        
HPCA 2023  
		                        AstriFlash: A Flash-Based System for Online Services
		                        Siddharth Gupta, Yunho Oh,  Lei Yan, Mark Sutherland, Abhishek Bhattacharjee, Babak Falsafi, and Peter Hsu
		                        The 29th IEEE International Symposium on High-Performance Computer Architecture (HPCA-29), Montreal, Quebec, Canada, 2023.
		                        
HPCA 2023  
		                        SnakeByte: A TLB Design with Adaptive and Recursive Page Merging in GPUs
		                        Jiwon Lee, Ju Min Lee, Yunho Oh,  William Song, and Won Woo Ro
		                        The 29th IEEE International Symposium on High-Performance Computer Architecture (HPCA-29), Montreal, Quebec, Canada, 2023.
		                        
ACM TACO  
		                        Scale-Out Systolic Arrays 
		                        Ahmet Caner Yüzügüler, Canberk Sönmez, Mario Drumond, Yunho Oh, Babak Falsafi, and Pascal Frossard
		                        ACM Transactions on Architecture and Code Optimization, vol. 20, issue 2, no. 27, pp. 1-25, Mar. 2023.
		                        
IEEE TC  
		                        FLIXR: Embedding Index into Flash Translation Layer in SSDs 
		                        Gunjae Koo*, Yunho Oh*, Hung-Wei Tseng, Won Woo Ro, and Murali Annavaram
		                        IEEE Transactions on Computers (*equally contributed as the first authors), vol. 72, no. 3, pp. 250-263, Jan. 2023.
		                        
IEEE ESL  
		                        CASH-RF: A Compiler-Assisted Hierarchical Register File in GPUs 
		                        Yunho Oh, Ipoom Jeong, Won Woo Ro, and Myung Kuk Yoon
		                        IEEE Embedded Systems Letters, vol. 14, pp. 187-190, Dec. 2022.
		                        
IEEE Access 
		                        Analyzing GCN Aggregation on GPU 
		                        Inje Kim, Jonghyun Jeong, Yunho Oh, Myung Kuk Yoon, and Gunjae Koo
		                        IEEE Access, vol. 10, pp. 113046-113060, Nov. 2022.
		                        
IEEE Access   
		                        GhostLeg: Selective Memory Coalescing for Secure GPU Architecture 
		                        Jongmin Lee, Seungho Jung, Taewon Suh, Yunho Oh,  Myung Kook Yoon, and Gunjae Koo
		                        IEEE Access, vol. 10, pp. 111449-111462, Nov. 2022.
		                        
IEEE Access 
		                        TEA-RC: Thread Context-Aware Register Cache for GPUs 
		                        Ipoom Jeong, Yunho Oh,  Won Woo Ro, and Myung Kuk Yoon
		                        IEEE Access, vol. 10, pp. 82049-82062, August. 2022.
		                        
ISCA 2021 
								Rebooting Virtual Memory with Midgard 
								Siddharth Gupta, Atri Bhattacharyya, Yunho Oh, Abhishek Bhattacharjee, Babak Falsafi, and Mathias Payer
								The 48th ACM/IEEE International Symposium on Computer Architecture, virtual conference, 2021.
								
MICRO 2020 
								Duplo: Lifting Redundant Memory Accesses of Deep Neural Networks for GPU Tensor Cores 
								Hyeonjin Kim, Sungwoo Ahn, Yunho Oh, Bogil Kim, Won Woo Ro, and William J. Song
								The 53rd IEEE/ACM International Symposium on Microarchitecture, virtual conference, 2020.
								
ISCA 2019 
								Linebacker: Preserving Victim Cache Lines in Idle Register Files of GPUs 
								Yunho Oh, Gunjae Koo, Murali Annavaram, and Won Woo Ro
								The 46th ACM/IEEE International Symposium on Computer Architecture, Phoenix, AZ, USA, 2019.
								
IEEE TC 
								Adaptive Cooperation of Prefetching and Warp Scheduling on GPUs 
								Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Murali Annavaram, and Won Woo Ro 
								IEEE Transactions on Computers, vol. 68, no. 4, pp. 609-616, April. 2019.
								
MICRO 2018 
								FineReg: Augmenting GPU Throughput via Fine-Grained Register File Management 
								Yunho Oh, Myung Kuk Yoon, William J. Song, and Won Woo Ro 
								The 51st ACM/IEEE International Symposium on Microarchitecture, Fukuoka, Japan, 2018
								
IEEE TC 
								WASP: Selective Data Prefetching with Monitoring Runtime Warp Progress on GPUs 
								Yunho Oh, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, and Won Woo Ro 
								IEEE Transactions on Computers, vol. 67, no. 9, pp. 1366-1373, Sept. 2018.
								
IEEE TPDS 
								Dynamic Resizing on Active Warps Scheduler to Hide Operation Stalls on GPUs 
								Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro 
								IEEE Transactions on Parallel and Distributed Systems, vol. 28, no. 11, pp. 3142-3156, Nov. 2017.
								
ISCA 2017 
								Access Pattern-Aware Cache Management for Improving Data Utilization in GPU 
								Gunjae Koo, Yunho Oh, Won Woo Ro, and Murali Annavaram 
								The 44th ACM/IEEE International Symposium on Computer Architecture, Toronto, ON, Canada, 2017.
								
ISCA 2016 
								APRES: Improving Cache Efficiency by Exploiting Load Characteristics on GPUs 
								Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Won Woo Ro, and Murali Annavaram 
								The 43rd ACM/IEEE International Symposium on Computer Architecture, Seoul, Korea, 2016.
								
ISPASS 2015 
								DRAW: Investigating Benefits of Adaptive Fetch Group Size on GPU 
								Myung Kuk Yoon, Yunho Oh, Sangpil Lee, Seung Hun Kim, Deokho Kim, and Won Woo Ro  
								The 2015 IEEE International Symposium on Performance Analysis of Systems and Software, Philadelphia, PA, USA, 2015.
								
ITC-CSCC 2015 
								Improving Pipeline Utilization with Two-Level Instruction Issue on GPUs 
								Yunho Oh, Jong Hyun Park, and Won Woo Ro 
								The 30th International Technical Conference on Circuits/Systems, Computers and Communications, Seoul, Korea, 2015.
								
KIISE 
								Introduction to Researches on Performance Bottlenecks of Many-Core GPU Architectures 
								Yunho Oh, Myung Kuk Yoon, Jong Hyun Park, and Won Woo Ro 
								Communications of KIISE, Vol. 32 No. 5, May, 2014.
								
IJPP 
								GPU-Friendly Parallel Genome Matching with Tiled Access and Reduced State Transition Table 
								Yunho Oh, Doohwan Oh, and Won W. Ro 
								International Journal of Parallel Programming, Vol. 41, No. 4, pp. 526-551, August, 2013.
								
ICEIC 2010 
								Multi-Threaded Filtered BackProjection Algorithm on Multi-Core Processors 
								Yun H. Oh, and Won W. Ro 
								The 10th International Conference on Electronics, Information, and Communication, Cebu, Philippines, 2010.
								
 ISMRM 2010 
								Accelerated Reconstruction Using Parallel Computing for Spiral Spectroscopic Imaging 
								Dong H. Kim, Yun H. Oh, Yun H. Nam, M. Gu, and Won W. Ro 
								International Society for Magnetic Resonance in Medicine Annual Meeting, Stockholm, Sweden, 2010.
								
ELEX 
								Hardware Implementation of a Tessellation Accelerator for the OpenVG Standard 
								Seung Hun Kim, Yunho Oh, Karam Park, and Won W. Ro 
								IEICE Electronics Express, Vol. 7, No. 6, pp 440-446, March, 2010.		
Workshops
MLArchSys 
								Accuracy Boosters: Epoch-Driven Mixed-Mantissa Block Floating Point for DNN Training 
								Simla Burcu Harma, Ayan Chakraborty, Babak Falsafi, Martin Jaggi, and Yunho Oh 
 
								ML for Computer Architecture and Systems co-located with ISCA 2023.		
SPMA 
								AstriFlash: An Online Flash-Based Memory Hierarchy 
								Siddharth Gupta, Yunho Oh, Lei Yan, Mark Sutherland, Abhishek Bhattacharjee, Babak Falsafi, and Peter Hsu 
								The 10th Workshop on Systems for Post-Moore Architectures co-located with Eurosys 2020.		
HENND 
								Accelerating Neural Network with Selective Thread-Level Parallelism Regulation and Cache Bypassing on GPUs 
								Kwanghee Chang, Yunho Oh, Myung Kuk Yoon, and Won Woo Ro 
								International Workshop on Highly Efficient Neural Networks Design in conjunction with ESWEEK 2017.		
Patent
Operation device of convolutional neural network, operation method of convolutional neural network and computer program stored in a recording medium to execute the method thereofWilliam Jinho Song, Won Woo Ro, Hyeonjin Kim, Sungwoo Ahn, Yunho Oh, and Bogil Kim
US Patent 20220391320A1. Central processing unit, GPU simulation method thereof, and computing system including the same
Won Woo Ro, Karam Park, Yunho Oh, Sangpil Lee, and Minwoo Kim
US Patent 9,378,533.
Experience
Professional Experience
Sep. 2025 - Current  Associate Professor, Korea University   
								
Sep. 2022 - Aug. 2025  Assistant Professor, Korea University   
								
Feb. 2021 - Aug. 2022  Assistant Professor, SungKyunKwan University   
								
Mar. 2019 - Feb. 2021  Postdoctoral Researcher, École Polytechnique Fédérale de Lausanne   
								 Principal Investigator: Prof. Babak Falsafi
	 
								 - New memory system architecture for cost-efficient datacenters
			
								 - Microarchitecture-specific server workload behavior analysis
								 
Sep. 2018 - Jan. 2019.  Postdoctoral Researcher, Yonsei University   
								 - New register file architecture for GPUs with new memory technology 
	
								 - Energy-efficient neural network processing on GPUs 
	 
								 - Entropy analysis on GPU register compression 
								 
Mar. 2014 - Aug. 2018.  Graduate Research Assistant (Ph.D.), Yonsei University   
								 - Memory locality-aware warp scheduling technique on GPUs 
								 - Efficient prefetching techniques on GPUs 
								 - Efficient register file management on GPUs 
								 - Energy-efficient memory system for GPUs 
								 - Energy-efficient multi-core processor architecture 
								 
Aug. 2016 - Aug. 2017.    Visiting Graduate Scholar, University of Southern California, LA, CA   
								 Faculty Sponsor: Prof. Murali Annavaram 
								 - Efficient cache management technique on GPUs 
								 - In-SSD database index management system 
								 
Feb. 2011 - Jan. 2014.   Software Engineer, Mobile Communications Business, Samsung Electronics    
								 - Developed mobile applications for Android smartphones 
								 - Developed a cloud service framework for wearable devices 
								 
Mar. 2009 - Feb. 2011.   Graduate Research Assistant (M.S.), Yonsei University    
								 - Parallel genome sequence matching algorithm for GPUs 
								 - High-performance medical image construction algorithms 
								 - OpenVG accelerator prototype design 
								 
Teaching Experience
At Korea University  Sep. 2022 ~ Current. 
								 	System Software
								 	Computer Systems
								 	Microprocessors
								 	Digital Systems
								 	Object-Oriented Programming
								 	Engineering Mathematic I
								
At Sungkyunkwan University  Feb. 2021~ Aug. 2022. 
								 	Logic Design Laboratory
								 	Research Projects for Execellent Undergraduates
								 	Embedded System Designs
								
At EPFL  Mar. 2019~ Feb. 2021. 
	
									Topics on ML Systems (CS723)
									Topics on Datacenter Design (CS728)
									Semester Project 
									Introduction to Multiprocessor Architecture (CS307) 
								
At Yonsei University  
	
									GPGPU Programming Practice (Samsung Advanced Technology Training Institute)
									Computer Architecture (TA)
									Electrical and Electronic Engineering Experiments (TA) 
								
Talks
Feb. 2025.   Semiconductor Macro Industry Trends in AI Era:New Wave of Scaling and Customization 
								 Applied Materials 
								 
Oct. 2023.   Designing Scale-Out Systems in Post-Moore Era 
								 SK Hynix Design Summit 
								 
Oct. 2023.   Numerical Encoding and DNN Training in Post-Moore Era 
								 ICCE-Asia 
								 
Sep. 2023.   SK Semiconductor Education Program 
								 SK Group 
								 
Aug. 2022.   Improving Energy Efficiency of AI Acceleration Systems 
								 AISRC, Seoul National University 
								 
Jan. 2022.   Designing Scale-Out Data-Centric Systems: Virtual Memory and Accelerators 
								 Seoul National University (Virtual)  
								 
Aug. 2021.   Data-Centric Computing, Brick by Brick 
								 2021 SIGARCH Korea Workshop (Virtual)  
								 
Aug. 2021.   Towards Energy-Efficient DNN Acceleration Systems 
								 2021 Summer School, Korean Society of Artificial Intelligent in Medicine (Virtual)  
								 
June. 2021.   Challenges in Next-Generation Datacenters: Accelerators and Memory Systems 
								 IT21 Conference (Virtual)  
								 
June. 2020.   Designing Workload-Aware GPU Architectures  
								 HiSilicon (Remote talk)  
								 
Sep. 2019.   Resolving Critical GPU Performance Bottlenecks  
								 Hongik University, Seoul, Korea  
								 
Nov. 2018.   Rethinking GPU Execution Model  
								 EPFL, Lausanne, VD, Switzerland  
								 
Oct. 2018.   Towards Extreme Throughput  
								 EPFL, Lausanne, VD, Switzerland 
								 
Dec. 2018.    FineReg: Augmenting GPU Throughput via Fine-Grained Register File Management  
								 Invited Talk in Software Congress 2018, PyeongChang, Korea  
								 
Services
 Organization Committee
									IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2026
									ACM/IEEE International Symposium on Microarchitecture (MICRO), 2025
									Fall Conference of IEIE, 2024
									Fall Conference of IEIE, 2023
									5th Workshop on Future and Challenges of Memory-Centric Computing, 2023 
									ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022
									IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2022
								
 Program Chair
									International Conference on Electronics, Information, and Communication (ICEIC), 2026 (TPC co-chair)
								
 Program Committee
									IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2026
									IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2026
									ACM/IEEE International Symposium on Microarchitecture (MICRO), 2025
									IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2025
									Conference on Systems and Machine Learning (MLSys), 2025
									IEEE International Symposium on Embedded Multicore/Manycore SoCs (MCSoC-2024) 
									Conference on Systems and Machine Learning (MLSys), 2024
									Workshop on General Purpose Processing using GPUs (GPGPU), 2024
									IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2024 (Light PC)
									Conference on Systems and Machine Learning (MLSys), 2023
									ACM Student Research Competition (SRC), 2022
									Conference on Systems and Machine Learning (MLSys), 2022
									Workshop on the Computing Continuum (in HiPEAC), 2020
								
 External Review Committee
									ACM/IEEE International Symposium on Microarchitecture (MICRO), 2024
									ACM/IEEE International Symposium on Microarchitecture (MICRO), 2022
									ACM/IEEE International Symposium on Microarchitecture (MICRO), 2021
									IEEE/ACM International Symposium on Microarchitecture (MICRO), 2020
									ACM/IEEE International Symposium on Computer Architecture (ISCA), 2019
								
 Session Chair
									Session 2A: GPU and Data Analytics: ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2022
								
 Reviewer
							    	IEEE Transactions on Very Large Scale Integration Systems, 2025
							    	ACM Transactions on Storage, 2025
							    	IEEE Computer Architecture Letters, 2025
							    	IEEE Transactions on Sustainable Computing, 2025
							    	IEEE Transactions on Very Large Scale Integration Systems, 2024
							    	IEEE Transactions on Computers, 2024
							    	IEEE Transactions on Parallel and Distributed Systems, 2024
							    	ACM Transactions on Architecture and Code Optimization, 2024
							    	IEEE Computer Architecture Letters, 2024
							    	IEEE Computer Architecture Letters, 2023
							    	Parallel Computing, 2023
							    	ACM Transactions on Design Automation of Electronic Systems, 2022
							    	IEEE Computer Architecture Letters, 2022
							    	IEEE Transactions on Emerging Topics in Computing, 2021
									IEEE Computer Architecture Letters, 2021
									Future Generation Computer Systems, 2020
									IEEE Computer Architecture Letters, 2019
									IEEE Access, 2019
								
 Tutorial Chair
									International Conference on Electronics, Information, and Communication (ICEIC), 2022
								
Honors and Awards
Jun. 2025.    Excellent Lecture Award   
								 Korea University  
								 
Oct. 2024.    Excellent Lecture Award   
								 Korea University  
								 
Sep. 2023.    Excellent Lecture Award   
								 Korea University  
								 
Mar. 2023.    Excellent Lecture Award   
								 Korea University  
								 
Nov. 2017.    Excellent Graduate Researcher Scholarship  
								 Yonsei University  
								 
Aug. 2016 - Aug. 2017.   BK21 Long-Term Overseas Research Fellowship  
								 Yonsei University  
								 
Mar. 2009 - Feb. 2011.   Mobile Communication Graduate Fellowship  
								 Samsung Electronics  
								 
Feb. 2008.    Excellent Undergraduate Awarded Honors and Scholarship  
								 Yonsei University  
								 
Contact
E-mail   [firstname]_[lastname] at korea dot ac dot kr
LinkedIn   https://www.linkedin.com/in/yunho-oh-comarch/