social.kernel.org

Harry (Hyeonggon) Yoo

@hyeyoo

Posts

671

Following

107

Followers

127

A professional kernel hacker, born in August 6, 2000, and living in Korea (the South one!).

- Linux Kernel Developer @ Oracle (Linux Kernel MM) (2025.02 ~ Present)
- A slab subsystem co-maintainer and a reviewer for the reverse mapping subsystem
- Former Intern @ NVIDIA, SK Hynix, Panmnesia (Security, MM and CXL)
- B.Sc. in Computer Science & Engineering, Chungnam National University (Class of 2025)

Opinions are my own.

My interests are:
Memory Management,
Computer Architecture,
Circuit Design,
Virtualization

Harry (Hyeonggon) Yoo

hyeyoo

First day of PT, quite tired.
…now my trainer says only two sandwiches are allowed every day. Oh no!

0

0

2

Harry (Hyeonggon) Yoo

repeated

Jonathan Corbet

corbet

For a while now, the kernel's configuration and build systems have been an area of concern for me. Almost nobody truly understands those complex subsystems, which were handled by a single maintainer.

That maintainer, Masahiro Yamada, has just stepped down after eight years on the job:

https://web.git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=8d6841d5cb20

Happily, Nathan Chancellor and Nicolas Schier have agreed to pick up the build system. The configuration system, instead, is now unmaintained. That ... seems less than optimal.

Thanks to Masahiro for doing this work all these years, and to Nathan and Nicolas for stepping up!

3

20

51

Harry (Hyeonggon) Yoo

hyeyoo

Reply to @ptesarik@infosec.exchange

@ptesarik Thanks for the answer!
Yeah, that’s the tricky part, deciding to ignore (or not) when statistics don’t tell us anything about it.

0

0

2

Harry (Hyeonggon) Yoo

hyeyoo

Reply to @hyeyoo

Edited 9 months ago

By the way, if you're using mmtests for benchmarking, some results are marked significant (with asterisks) and some are not.

not significant: 17.74 ( -3.72%)
significant: 132.26 * -15.92%*

I assumed the asterisk would have some meaning, but didn't exactly know what it is until I read the code: "significant" means their (baseline and new) confidence intervals do not overlap.

When the variance is high and the number of observations is low, many results may not be marked as significant. However, that doesn’t necessarily mean the results can simply be ignored (?)

1

0

2

Harry (Hyeonggon) Yoo

hyeyoo

Edited 9 months ago

Still digesting it, and may be slightly incorrect, but a summary of what I've learned: properly using statistics to compare the performance of computer systems [1] [2].

0. When you get a sample and calculate sample mean, it is likely different from the population mean. Simply comparing two sample means from different computer systems may therefore lead to misleading conclusions, since the observed difference could be due to random sampling variation rather than a real performance difference (especially when the variance is high).

1. Central Limit Theorem indicates that, regardless of the underlying distribution, the sampling distribution of the mean tends to follow normal distribution when the sample size is sufficiently large (typically n >= 30). But to apply CLT, the observations should be independent and collected from the same distribution.

2. Based on the CLT, we can estimate how close the sample mean (to be precise, any population parameter) is likely to be to the population mean. A confidence interval [x, y] with a confidence level of p% means that if we repeated the sampling process many times under the same conditions, about p% of those intervals would contain the population mean.

3. To compare two distributions, confidence intervals can help determine whether the difference between mean values is statistically significant. This can be done by:

- Checking whether the confidence intervals of the two samples do not overlap, or
- Examining the confidence interval for the mean of the differences to check if it does not include 0.

4. Caveat: The CLT assumes that the data is collected independently, meaning one observation does not affect another. In computer systems, this assumption usually does not hold. Caches, memory layout, scheduling decisions etc. can introduce some degree of dependencies between observations.

This can be mitigated by 1) reducing dependence between experiments as much as possible or 2) by applying the bootstrapping method [3].

[1] Jan Kara, Measuring performance regressions, https://youtu.be/HAHhW13ofrg?si=drgegMwXUDegHsQf
[2] Dev Jain, The Art of Computer Systems Performance Analysis: Techniques for Experimental Design, Measurement, Simulation, and Modeling, https://www.amazon.com/Art-Computer-Systems-Performance-Analysis/dp/0471503363
[3] https://en.wikipedia.org/wiki/Bootstrapping_(statistics)

1

1

3

Harry (Hyeonggon) Yoo

hyeyoo

Reply to @vbabka@mastodon.social

@vbabka @ljs @wagi

right, no matter how many transistors you throw into a cache.

0

0

3

Harry (Hyeonggon) Yoo

hyeyoo

Reply to

@wagi @ljs

I have a bigger, scalable bookshelf and I ordered small one to use it like a cache for a larger bookshelf but turns out the cache was too small :P

1

0

4

Harry (Hyeonggon) Yoo

hyeyoo

Reply to

@ljs it's so sad that the bookshelf isn't big enough to fit the best book!

0

0

2

Harry (Hyeonggon) Yoo

hyeyoo

Got a new small bookshelf but the user experience isn’t great

0

0

5

Harry (Hyeonggon) Yoo

hyeyoo

Reply to @conor

@conor @ljs @vbabka

Haha yeah, quite nice (watching my legs tremble)

1

0

2

Harry (Hyeonggon) Yoo

hyeyoo

Reply to

@ljs @vbabka /me prepares to go out

0

0

2

Harry (Hyeonggon) Yoo

hyeyoo

Reply to @vbabka@mastodon.social

@vbabka @ljs do you?

1

0

2

Harry (Hyeonggon) Yoo

hyeyoo

Reply to @vbabka@mastodon.social

@vbabka nah 5am gym sounds like @ljs !

1

0

4

Harry (Hyeonggon) Yoo

hyeyoo

Reply to @oleksandr@natalenko.name

Edited 9 months ago

@oleksandr didn’t refuse anything :’(
Perhaps should have refused a looong nap at night before 2am call

0

0

1

Harry (Hyeonggon) Yoo

hyeyoo

Reply to @vbabka@mastodon.social

@vbabka just woke up from hibernation right now uh

1

0

1

Harry (Hyeonggon) Yoo

hyeyoo

Oh god let me fall asleep

2

0

1

Harry (Hyeonggon) Yoo

repeated

Omar Sandoval

osandov

@sj recently started using drgn to add more DAMON selftests, which is a super interesting use case that I didn't envision: https://lore.kernel.org/all/20250628160428.53115-1-sj@kernel.org/. It has already found a real bug! https://lore.kernel.org/all/20250719181932.72944-1-sj@kernel.org/

0

3

6

Harry (Hyeonggon) Yoo

hyeyoo

Reply to @oleksandr@natalenko.name

@oleksandr @vbabka

SLAIB?

size = ai_prompt("Size sufficient to fit five integers");
gfp = ai_prompt("Appropriate GFP flags for the current context?");
array = kmalloc(size, gfp);

1

1

1

Harry (Hyeonggon) Yoo

repeated

Oleksandr Natalenko, MSE

oleksandr@natalenko.name

linux-stable is a vibe-backported frankenkernel

0

2

1

Harry (Hyeonggon) Yoo

repeated

Linux Plumbers Conference

linuxplumbersconf@mastodon.social

All microconferences (MCs) at LPC 2025 have been accepted! It is time to submit topics to your favorite MCs.

Please check out our latest blog post for the list of MCs, and how to create a ideal MC topic.

https://lpc.events/blog/current/index.php/2025/07/25/all-microconferences-have-been-accepted/

0

9

4

About social.kernel.org

Terms of service

Please do not use this service in violation of the Linux Kernel Code of Conduct. Doing so will result in your account suspension with the referral of the matter to the CoC committee.
"Repeating"/"boosting" someone else's status on this platform will be treated as endorsement and will fall under rule #1.
You are encouraged to use this platform to promote your work on the Linux Kernel, but there is no restriction on permitted topics (with the exception of anything covered by #1 above).
There is no requirement to post in English, but it should be considered the primary language of communication on this platform.

Privacy notice

The admins of this service have access to all posted statuses. They aren't looking, but if it's something they shouldn't know about, then you should not post it on this platform.

Please see the Linux Foundation Privacy Policy, which applies to this platform as well.

Getting your own account

If you would like an account on this instance, please check that the following applies to you:

You are listed in MAINTAINERS or CREDITS
OR: You have a kernel.org account or email address
OR: You have a long and established history of involvement with the Linux Kernel

If the above is true and you agree with the Terms of Service and Privacy Notice listed above, please use these instructions to request an account:

How to request an account on social.kernel.org