0% found this document useful (0 votes)

54 views20 pages

Microkernels vs Linux Scalability Debate

This document summarizes and refutes claims made in another talk that microkernel-based operating systems cannot scale to support thousands of CPUs like Linux can. The document makes the following key points: 1) A microkernel-based system from the University of Toronto was shown to scale perfectly to support 16 processors in 1999, while Linux's scaling abilities were much more limited at the time. 2) With careful design, microkernel architectures can avoid synchronization overhead and allow subsystems to share memory, enabling scaling to large processor counts. 3) The performance of microkernel inter-process communication is often misunderstood and exaggerated as a significant limitation, when in reality many systems see IPC costs of just a few hundred

Uploaded by

Robert And-One Campbell

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views20 pages

Microkernels vs Linux Scalability Debate

Uploaded by

Robert And-One Campbell

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Do Microkernels Suck?

UNSW, NICTA and Open Kernel Labs

Gernot Heiser

OLS 2007
Talk by Christoph Lameter: Extreme High Performance Computing or Why Microkernels Suck Contents: This is how we got Linux to scale to 1000's of CPUs

clearly knows what he's talking about no need to add to this...

This is why microkernels can't do the same

clearly hasn't got a clue about microkernels

I'll explain...

Summary of Paper
Look, we've scaled Linux to 1000 processors [with a little help of billions of $$ from IBM, HP, SGI, ...], microkernels [developed mostly by cash-strapped universities] haven't done the same, obviously they suck Equivalent statement in 1998: Look, Windows has drivers for zillions of devices, Linux doesn't, hence Linux sux. Very scientific approach, right? OK, I'm exaggerating somewhat, but let's see what it really says...

Common Misconceptions
Microkernel-based systems are less reliable, as failure of one component makes whole system fail Wrong! Counter example: QNX High Availability Toolkit (sold commercially since 2001) More recent counter example: Minix 3, which is open source check it out for yourself Were reliability matters most, microkernels are used aerospace, automotive, medical devices...

A Voice from the Coal Face

NTFS-3G is a user/hybrid-space driver Similar functionality and performance on commodity hardware as in-kernel file systems The invested effort and resource were only a fraction of what is usually needed, besides other benefits. The empirical learnings keep being highly instructive, refuting widely believed folklore Szaka Szabolcs, leader of NTFS-3G, http://ntfs-3g.org

Common Misconceptions
Microkernel relies on IPC, IPC requires expensive message queue operations, hence IPC is costly Wrong! Counter example: L4, since 1993 (publ in SOSP) L4 runs in 10s of millions of mobile phone OS performance is critical for cell-phone baseband processing L4 expected to run on 250M mobile devices within a year Why the sudden popularity? it's fast it's small it enables fault containment

Let's Look at IPC

IPC is used to obtain system service IPC performance is important

Intrinsic Difference Syscall vs IPC

Syscall: 2 mode switches (userkernel, kerneluser) IPC: 2 mode switches + 1 context switch Server invocation needs 2 IPCs extra cost is 2 mode switches, 2 context switches This is the inherent microkernel overhead! it is wrong to think that IPC was used inside the system a lot (replacing function calls) Is it significant? depends on the ratio between overhead and total cost of service obtained it's a killer for the null system call it's irrelevant for most others

Actual L4 IPC Cost [cycles]

Intra address space 113 125 36 109 170 Inter address space 305 230 36 109 180

Architecture Pentium AMD-64 Itanium MIPS64 ARM Xscale

How do a couple hundred cycles compare to the typical Linux system call???

Sort-of Extreme Example: Linux on L4

Cops the full microkernel overhead Doesn't get any of the microkernel benefits How does it perform?

Linux on L4: ReAIM Macrobenchmark

ReAIM Benchmark 1 Task 2 Tasks 3 Tasks

Native Virtualised 45.2 43.6 23.6 22.6 15.8 15.3

Ratio 0.96 0.96 0.97

Native Linux vs Linux virtualized on L4 on Xscale PXA255 @ 400MHz Not everything in L4 fully optimised yet (fork/exec)

Lmbench microbenchmarks
Benchmark Native Virtualized lmbench latencies in microseconds, smaller is better lat_proc procedure 0.21 0.21 lat_proc fork 5679 8222 lat_proc exec 17400 26000 lat_proc shell 45600 68800 lmbench bandwidths, MB/s, larger is better bw_file_rd 1024 io_only 38.8 26.5 bw_mmap_rd 1024 mmap_only 106.7 106 bw_mem 1024 rd 416 412.4 bw_mem 1024 wr 192.6 191.9 bw_mem 1024 rdwr 218 216.5 bw_pipe 7.55 20.64 bw_unix 17.5 11.6 Ratio 0.99 0.69 0.67 0.66 0.68 0.99 0.99 1 0.99 2.73 0.66

Native Linux vs Linux virtualized on L4 on Xscale PXA255 @ 400MHz Not everything in L4 fully optimised yet (fork/exec)

Lmbench Context Switching

Benchmark Native Virtualized Ratio lmbench latencies in microseconds, smaller is better lat_ctx -s 0 1 11 20 0.55 lat_ctx -s 0 2 262 5 52.4 lat_ctx -s 0 10 298 45 6.62 lat_ctx -s 4 1 48 58 0.83 lat_ctx -s 4 10 419 203 2.06 lat_fifo 509 49 10.39 lat_pipe 509 49 10.39 lat_unix 1015 77 13.18 lat_syscall null 0.8 4.8 0.17
Native Linux vs Linux virtualized on L4 on Xscale PXA255 @ 400MHz

How Can Virtual be Faster than Real?

It's a microkernel! microkernel Complete kernel is about 1011kloc! Linux is big! big 100s of kloc not counting drivers, file systems etc ARM MMU is quirky, needs a lot of effort to optimise much easier to optimize a small code base Of course, the same can be achieved with Linux in fact, we did it and offered patches upstream maintainers didn't take who cares about factor of 50! Snapgear is running our patches in their modems

Back to Multiprocessor Scalability

Lameter myth: IPC is needed across nodes inside a microkernel OS, and on NUMA this causes problems allocating the message queues NUMA-friendly

Whom you gonna call local or remote OS????

Multiprocessor Scalability

syscall slowdown vs # CPUs compare against several commercial systems

only one system scales (constant slowdown) which is it?

What's the story?

Tornado microkernel scales perfectly to 16p this is 1999! [Gamsa et al, 3rd OSDI] done by a small group at Univ of Toronto Tornado is predecessor of IBM's K42 How far did Linux scale in 1999? How far would Linux scale today on the same bechmarks? Note: the benchmarks show concurrent ops on all CPUs
page faults, fstats, thread creation

Synchronization Claims
Microkernel isolation limits synchronization methods Data structures have to be particular to subsystems Linux would never have been able to scale to these extremes with a microkernel approach because of the rigid constraints that strict microkernel designs place on the architecture of operating systems This is simply wrong (repeating doesn't make it right) synchronisation in a well-designed system is local to subsystems there is no reason why subsystems can't share memory, even if microkernel-based

OS Scalability Principles
OS must not impose synchronisation overhead except as forced by user code Then user code scalable system scalable What does this mean? keep data structures local process system calls on the caller's CPU only involve other CPUs if the caller explicitly asks for it!
creating/killing/signalling a thread on another CPU invoking a synchronisation system call unmap pages

If this is done, you get a scalable OS even if the apps actually perform system calls user pays what user asks for...

Summary
Hey, I can do this cool thing but you can't How do you know if you don't understand me? Linux is cool but this doesn't mean it is perfect for everything nor does it mean Linux will remain as is forever Same is true for microkernels

L Organization
No ratings yet
L Organization
6 pages
L4 OS Structure II Microkernels and Exokernels
No ratings yet
L4 OS Structure II Microkernels and Exokernels
23 pages
Microkernels: Mach vs. L4 Analysis
No ratings yet
Microkernels: Mach vs. L4 Analysis
32 pages
2017edan85l4 1
No ratings yet
2017edan85l4 1
33 pages
Unix Architecture
No ratings yet
Unix Architecture
14 pages
Microkernels and L4: COMP9242 2006/S2 Week 1
No ratings yet
Microkernels and L4: COMP9242 2006/S2 Week 1
86 pages
Sun Solaris OS: Glenn Barney Gb2174@columbia - Edu
No ratings yet
Sun Solaris OS: Glenn Barney Gb2174@columbia - Edu
35 pages
SMP Synchronization in Linux Kernel
No ratings yet
SMP Synchronization in Linux Kernel
18 pages
Microkernel
No ratings yet
Microkernel
15 pages
The Multikernel: A New OS Architecture For Scalable Multicore Systems
No ratings yet
The Multikernel: A New OS Architecture For Scalable Multicore Systems
15 pages
Linux Scalability on Multi-Core Systems
No ratings yet
Linux Scalability on Multi-Core Systems
49 pages
Linux 4.10: New Features for Developers
No ratings yet
Linux 4.10: New Features for Developers
19 pages
L3 OS Structure
No ratings yet
L3 OS Structure
16 pages
Introduction to Operating Systems Course
No ratings yet
Introduction to Operating Systems Course
33 pages
Osdi24 Chen Haibo
No ratings yet
Osdi24 Chen Haibo
22 pages
Module II
No ratings yet
Module II
74 pages
OS Design & Implementation Course
No ratings yet
OS Design & Implementation Course
9 pages
Overview of Operating Systems and Linux
No ratings yet
Overview of Operating Systems and Linux
22 pages
Create A Report On - Micro-Kernel Operating System...
No ratings yet
Create A Report On - Micro-Kernel Operating System...
4 pages
3 OS Multiprocessing NUMA
No ratings yet
3 OS Multiprocessing NUMA
36 pages
Understanding SMP and Thread Management
No ratings yet
Understanding SMP and Thread Management
52 pages
Comp9242 Advanced Os: S2/2016 W01: Introduction To Sel4 @gernotheiser
No ratings yet
Comp9242 Advanced Os: S2/2016 W01: Introduction To Sel4 @gernotheiser
55 pages
Suse Linux Enterprise Server and High Performance Computing
No ratings yet
Suse Linux Enterprise Server and High Performance Computing
51 pages
Linux Internals and Networking
100% (3)
Linux Internals and Networking
177 pages
#2 (Design, Privileges, Concepts)
No ratings yet
#2 (Design, Privileges, Concepts)
30 pages
Os Module 1
No ratings yet
Os Module 1
22 pages
PikeOS Microkernel Evolution
No ratings yet
PikeOS Microkernel Evolution
8 pages
Linux Architecture
No ratings yet
Linux Architecture
39 pages
CSE 5343/7343 Fall 2006 Case Studies: Comparing Windows XP and Linux
No ratings yet
CSE 5343/7343 Fall 2006 Case Studies: Comparing Windows XP and Linux
44 pages
Eeus2012 Singhvi
No ratings yet
Eeus2012 Singhvi
26 pages
Overview of Linux Programming Concepts
100% (1)
Overview of Linux Programming Concepts
17 pages
08 Exokernels Containers
No ratings yet
08 Exokernels Containers
43 pages
CSE357 Workbook
No ratings yet
CSE357 Workbook
62 pages
Linux Case Study
No ratings yet
Linux Case Study
13 pages
Chap 04
No ratings yet
Chap 04
10 pages
Chap 04
No ratings yet
Chap 04
10 pages
Operating Systems
No ratings yet
Operating Systems
24 pages
Operating System Structures Explained
No ratings yet
Operating System Structures Explained
21 pages
Case Comp
No ratings yet
Case Comp
20 pages
Processes: Hongfei Yan School of EECS, Peking University 3/15/2010
No ratings yet
Processes: Hongfei Yan School of EECS, Peking University 3/15/2010
69 pages
Operating Systems Notes Unit 1: Introduction & Basics: 1. What Is An Operating System (OS) ?
No ratings yet
Operating Systems Notes Unit 1: Introduction & Basics: 1. What Is An Operating System (OS) ?
83 pages
Inside the Linux Kernel Basics
No ratings yet
Inside the Linux Kernel Basics
40 pages
01 Introduction Linux
No ratings yet
01 Introduction Linux
61 pages
Design Principles
No ratings yet
Design Principles
28 pages
Operating Systems Unit 1 - UPC - ANS - 2M
No ratings yet
Operating Systems Unit 1 - UPC - ANS - 2M
19 pages
UNIX Internals: Rohit Jnagal
No ratings yet
UNIX Internals: Rohit Jnagal
36 pages
Os Web References
No ratings yet
Os Web References
6 pages
Overview of Embedded Operating Systems
No ratings yet
Overview of Embedded Operating Systems
42 pages
Mosix Instrukcja Ang
No ratings yet
Mosix Instrukcja Ang
7 pages
Embssedded App Dev
No ratings yet
Embssedded App Dev
8 pages
Os Minor Imp - Questions
No ratings yet
Os Minor Imp - Questions
25 pages
Thread Level Parallelism (2) : EEC 171 Parallel Architectures John Owens UC Davis
No ratings yet
Thread Level Parallelism (2) : EEC 171 Parallel Architectures John Owens UC Davis
45 pages
NetBSD 5.0: For Developers & Hobbyists
No ratings yet
NetBSD 5.0: For Developers & Hobbyists
17 pages
Comparing Windows XP and Linux
No ratings yet
Comparing Windows XP and Linux
44 pages
Solaris 10 Virtualization Overview
No ratings yet
Solaris 10 Virtualization Overview
26 pages
Rtos Book
No ratings yet
Rtos Book
96 pages
Developing A Debugger For Real-Time Operating Syst
No ratings yet
Developing A Debugger For Real-Time Operating Syst
6 pages
CyBOK v1.1.0-3
No ratings yet
CyBOK v1.1.0-3
200 pages
Secure OS
No ratings yet
Secure OS
2 pages
Monolithic Vs Microkernel
100% (1)
Monolithic Vs Microkernel
6 pages
SEP Analysis
100% (2)
SEP Analysis
41 pages
L4Android: A Generic Operating System Framework For Secure Smartphones
No ratings yet
L4Android: A Generic Operating System Framework For Secure Smartphones
39 pages
Darbat Darwin L4kernel
No ratings yet
Darbat Darwin L4kernel
20 pages
Device Creation - Abi Nourai - Open Kernel Labs
No ratings yet
Device Creation - Abi Nourai - Open Kernel Labs
19 pages
L4 Microkernels: The Lessons From 20 Years.
No ratings yet
L4 Microkernels: The Lessons From 20 Years.
30 pages
Understanding Computer Kernels
No ratings yet
Understanding Computer Kernels
17 pages
Microkernel Operating System Overview
No ratings yet
Microkernel Operating System Overview
69 pages
Lecture 03
No ratings yet
Lecture 03
37 pages
Xen On ARM System Virtualization Using Xen Hypervisor For ARM-based Secure Mobile Phones
No ratings yet
Xen On ARM System Virtualization Using Xen Hypervisor For ARM-based Secure Mobile Phones
6 pages
Pistachio Whitepaper
No ratings yet
Pistachio Whitepaper
4 pages
L4Ka Microkernel Research Overview
No ratings yet
L4Ka Microkernel Research Overview
8 pages
ARM9 Architecture and Compiler Overview
No ratings yet
ARM9 Architecture and Compiler Overview
3 pages
Skybridge: Fast and Secure Inter-Process Communication For Microkernels
No ratings yet
Skybridge: Fast and Secure Inter-Process Communication For Microkernels
15 pages
Pancake: Verified Systems Programming Made Sweeter
No ratings yet
Pancake: Verified Systems Programming Made Sweeter
7 pages
微内核
No ratings yet
微内核
26 pages
Sel4 Microkernel For Virtualization Use-Cases: Potential Directions Towards A Standard VMM
No ratings yet
Sel4 Microkernel For Virtualization Use-Cases: Potential Directions Towards A Standard VMM
18 pages

Microkernels vs Linux Scalability Debate

Uploaded by

Microkernels vs Linux Scalability Debate

Uploaded by

Do Microkernels Suck?

UNSW, NICTA and Open Kernel Labs

clearly knows what he's talking about no need to add to this...

This is why microkernels can't do the same

clearly hasn't got a clue about microkernels

A Voice from the Coal Face

Let's Look at IPC

Intrinsic Difference Syscall vs IPC

Actual L4 IPC Cost [cycles]

Architecture Pentium AMD-64 Itanium MIPS64 ARM Xscale

Sort-of Extreme Example: Linux on L4

Linux on L4: ReAIM Macrobenchmark

ReAIM Benchmark 1 Task 2 Tasks 3 Tasks

Native Virtualised 45.2 43.6 23.6 22.6 15.8 15.3

Ratio 0.96 0.96 0.97

Lmbench Context Switching

How Can Virtual be Faster than Real?

Back to Multiprocessor Scalability

Whom you gonna call local or remote OS????

syscall slowdown vs # CPUs compare against several commercial systems

only one system scales (constant slowdown) which is it?

What's the story?

You might also like