Rational Java: JLBH Examples 1 - Why Code Should be Benchmarked in Context

In this post:

A side by side example using JMH and JLBH for Date serialisation
Measuring Date serialisation in a microbenchmark
Measuring Date serialisation as part of a proper application
How to add a probe to your JLBH benchmark
Understanding the importance of measuring code in context

In the last post, 'Introducing JLBH' we introduced JLBH the latency testing tool that Chronicle used to test Chronicle-FIX and is now available as open source.

In the next few posts we are going to look at some example applications:

All the code for the examples cane be found here in my GitHub project:

One of the points I made in the introduction to JLBH was that it is important to benchmark code in context. That means benchmarking code in an environment as close to how it will be running in real life as possible. This post demonstrates this in practice.

Let's look at a relatively expensive Java operation - Date Serialisation - and see how long it takes:

First here's a JMH benchmark:

Running on my laptop (MBP i7) these are the results I get:

Result "test":
4.578 ±(99.9%) 0.046 us/op [Average]
(min, avg, max) = (3.664, 4.578, 975.872), stdev = 6.320
CI (99.9%): [4.533, 4.624] (assumes normal distribution)
Samples, N = 206803
mean = 4.578 ±(99.9%) 0.046 us/op
min = 3.664 us/op
p( 0.0000) = 3.664 us/op
p(50.0000) = 4.096 us/op
p(90.0000) = 5.608 us/op
p(95.0000) = 5.776 us/op
p(99.0000) = 8.432 us/op
p(99.9000) = 24.742 us/op
p(99.9900) = 113.362 us/op
p(99.9990) = 847.245 us/op
p(99.9999) = 975.872 us/op
max = 975.872 us/op

# Run complete. Total time: 00:00:21

Benchmark Mode Cnt Score Error Units

DateSerialiseJMH.test sample 206803 4.578 ± 0.046 us/op

A mean time of 4.5us for the operation:

We get pretty much the same results running the test with JLBH:

These are the results:

Warm up complete (400000 iterations took 2.934s)
-------------------------------- BENCHMARK RESULTS (RUN 1) ---------
Run time: 10.0s
Correcting for co-ordinated:true
Target throughput:100000/s = 1 message every 10us
End to End: (1,000,000) 50/90 99/99.9 99.99/99.999 - worst was 4.2 / 5.8 352 / 672 803 / 901 - 934
OS Jitter (13,939) 50/90 99/99.9 99.99 - worst was 8.4 / 17 639 / 4,130 12,850 - 20,450
--------------------------------------------------------------------
-------------------------------- BENCHMARK RESULTS (RUN 2) ---------
Run time: 10.0s
Correcting for co-ordinated:true
Target throughput:100000/s = 1 message every 10us
End to End: (1,000,000) 50/90 99/99.9 99.99/99.999 - worst was 4.2 / 5.8 434 / 705 836 / 934 - 967
OS Jitter (11,016) 50/90 99/99.9 99.99 - worst was 8.4 / 17 606 / 770 868 - 1,340
--------------------------------------------------------------------
-------------------------------- BENCHMARK RESULTS (RUN 3) ---------
Run time: 10.0s
Correcting for co-ordinated:true
Target throughput:100000/s = 1 message every 10us
End to End: (1,000,000) 50/90 99/99.9 99.99/99.999 - worst was 4.2 / 5.8 434 / 737 901 / 999 - 1,030
OS Jitter (12,319) 50/90 99/99.9 99.99 - worst was 8.4 / 15 573 / 737 803 - 901
---------------------------------------------------------------------------------------------------- SUMMARY (end to end)---------------Percentile run1 run2 run3 % Variation
50: 4.22 4.22 4.22 0.00
90: 5.76 5.76 5.76 0.00
99: 352.26 434.18 434.18 0.00
99.9: 671.74 704.51 737.28 3.01
99.99: 802.82 835.58 901.12 4.97
worst: 901.12 933.89 999.42 4.47
--------------------------------------------------------------------

A mean time of 4.2us for the operation:

Note: This is case where there is no advantage using JLBH over JMH. I just include the code as a comparison.

Now we're going to run exactly the same operation but inside a TCP call the code will work like this:

Client send the server a fix message over TCP loopback (localhost)
Server reads the message
Server does the date serialisation
Server returns a message to the client

As explained in the previous post JLBH allows us to produce a latency profile for any part of the code. We will add a probe for stage 3.

This time the results look like this:

Warm up complete (50000 iterations took 3.83s)
-------------------------------- BENCHMARK RESULTS (RUN 1) ------------------------
Run time: 6.712s
Correcting for co-ordinated:true
Target throughput:20000/s = 1 message every 50us
End to End: (100,000) 50/90 99/99.9 99.99 - worst was 822,080 / 1,509,950 1,711,280 / 1,711,280 1,711,280 - 1,711,280
date serialisation (100,000) 50/90 99/99.9 99.99 - worst was 11 / 19 31 / 50 901 - 2,420
OS Jitter (64,973) 50/90 99/99.9 99.99 - worst was 8.1 / 16 40 / 1,540 4,850 - 18,350
--------------------------------------------------------------------
-------------------------------- BENCHMARK RESULTS (RUN 2) ---------
Run time: 6.373s
Correcting for co-ordinated:true
Target throughput:20000/s = 1 message every 50us
End to End: (100,000) 50/90 99/99.9 99.99 - worst was 1,107,300 / 1,375,730 1,375,730 / 1,375,730 1,375,730 - 1,375,730
date serialisation (100,000) 50/90 99/99.9 99.99 - worst was 11 / 19 29 / 52 901 - 1,670
OS Jitter (40,677) 50/90 99/99.9 99.99 - worst was 8.4 / 16 34 / 209 934 - 1,470
--------------------------------------------------------------------
-------------------------------- BENCHMARK RESULTS (RUN 3) ---------
Run time: 5.333s
Correcting for co-ordinated:true
Target throughput:20000/s = 1 message every 50us
End to End: (100,000) 50/90 99/99.9 99.99 - worst was 55,570 / 293,600 343,930 / 343,930 343,930 - 343,930
date serialisation (100,000) 50/90 99/99.9 99.99 - worst was 9.0 / 16 26 / 38 770 - 1,030
OS Jitter (32,042) 50/90 99/99.9 99.99 - worst was 9.0 / 13 22 / 58 737 - 934
--------------------------------------------------------------------
-------------------------------- SUMMARY (end to end)---------------
Percentile run1 run2 run3 % Variation
50: 822083.58 1107296.26 55574.53 92.66
90: 1509949.44 1375731.71 293601.28 71.07
99: 1711276.03 1375731.71 343932.93 66.67
99.9: 1711276.03 1375731.71 343932.93 66.67
99.99: 1711276.03 1375731.71 343932.93 66.67
worst: 1711276.03 1375731.71 343932.93 66.67
--------------------------------------------------------------------
-------------------------------- SUMMARY (date serialisation )------
Percentile run1 run2 run3 % Variation
50: 11.01 11.01 8.96 13.22
90: 18.94 18.94 15.62 12.44
99: 31.23 29.18 26.11 7.27
99.9: 50.18 52.22 37.89 20.14
99.99: 901.12 901.12 770.05 10.19
worst: 2424.83 1671.17 1032.19 29.21

--------------------------------------------------------------------

As can be seen the very same Date Serialisation take over twice as long from ~4.5us to ~10us.

It's not really the place here to go into too much detail about why the code takes longer to run when in context but it's to do with CPU caches getting filled in between calls to the date serialisation.

When all we are running (as in the micro benchmark) is the Date Serialisation then that can fit nicely into a CPU cache and never needs to get cleared out. However when there is a gap between calls to the Date serialisation the code for the operation gets cleared out and needs to be reloaded.

JLBH allows you to benchmark code in context and that's an important part of latency benchmarking.

15 comments:

Jean-Philippe Bempel19 April 2016 at 11:47
Hello,
I cannot agree more about running in context. I always considered microbenchmark as an idealization of the execution.

Reality is far from this. So any optimizations based on a Microbenchark, considering the order of magnitude between L1 cache and Dram access, can be completely ruin by a L3 miss.

Cheers
custom writing service reviews20 October 2017 at 17:36
JLBH does seem to work better. Thanks for recommending it and posting the example as well. Your post will definitely help everyone make the right decision.
Unknown2 December 2018 at 09:01
InstantPcApps
It's a hub for free mac, pc, android and ios software. You can get any kind of game, software and mobile application free of cost. 4K Video Downloader Crack
Unknown12 July 2019 at 13:26
Thanks for posting
Raju Mehandi Wala19 November 2019 at 09:17
Hello,
Thanks a lot dude,It means a lot of me, Its amazing, I have learned many things on your blog. If you dont mind kindly visit my blog...
Mehndi Artist in Delhi | Best Mehndi Artist in Delhi | Simple Mehndi Design | Mehndi Service at Home in Delhi
Delhi Courses Institutes13 January 2020 at 09:40
Extremely decent blog and articles. I am realy extremely glad to visit your blog. Presently I am discovered which I really need. Top 10 Digital Marketing Institute in Delhi | Top Digital Marketing Institute in Delhi | Marketing Courses in Delhi | Best Digital Marketing Training in Delhi
Nimrat Saxena24 January 2020 at 11:00
I really appreciate your work. Thanks for providing the information with this post... Physics Tuition in Rohini | Physics Classes in Rohini | Physics Institute in Rohini | Physics Classes in Rohini Delhi | Physics Tuition Center in Rohini
Top Coaching Centres Rohini27 January 2020 at 09:10
I like your blog post very much so thank you very much for posting and keep updating.

Web Designing Course in Rohini
Web Designing Training in Rohini
Web Designing Training in Rohini
Web Designing Course in Rohini
Web Designing Course in Rohini
patsm00re1826 June 2020 at 06:53
You realize the great math practice for your children? Prodigy Game is math practice used to be a battle — however not any longer.
mecolefqwarxp27 August 2020 at 09:20
3. Al Jefferson, Bobcats: Jefferson is already out with a badly sprained ankle. Charlotte is a tough place to play for any standout because there is very little help. Most of us breathe a little sigh of relief. Now it time to gather up the wrinkled, torn wrapping paper, frayed ribbons and bows, smashed gift boxes, and those pieces of infernal, impossible to open rigid plastic packaging (does anybody else hate trying to break into that stuff?) and to put it all into garbage bags or recycling bins for hauling away. The sooner, the better, I say.. (tags: Jordan Shoes For Sale Cheap, 2020 Jordan Release Dates, Ray Ban Sunglasses Outlet)

If people want marriage, however, they would have to find a private ceremony for that. I dare say many churches and synagogues would not offer such a ceremony to gays or to polygamists and as private institutions that would be their right. Government, on the other hand, should have no such power.. (tags: Cheap Real Yeezys, Discount Jordan Shoes Wholesale, Cheap Yeezys)

Hunt is replaced in the halves by Darren Nicholls. Leeson Ah Mau (rested following Denver Test) returns at starting prop for Vaughan. Blake Lawrie starts at prop for De Belin. Visit the Best Offers page on the website to check for cellar deals and savings. Wine lovers can score extra discounts by joining the Laithwaites Wine Club. Members receive cases of hand picked wines and can enjoy an introductory case of wine for 50% off, with two free glasses included in the offer (tags: Coach Outlet Store Online , Michael Kors Bags Sale, Coach Outlet Store, MK Outlet).
Quickbooks Expert27 January 2021 at 06:26
Nice & Informative Blog !
QuickBooks is an accounting software that effortlessly fulfils all the needs of businesses. In case you want an immediate help for QuickBooks error, call us on QuickBooks Phone Number 1-855-974-6537 to get your problem fixed instantly.
steve1 March 2021 at 06:34
Quickbooks Tool Hub is your one stop solution for any kinds of errors that occur in Quickbooks desktop app, Know more about this tool on how to download and install this tool.
alexa jone19 August 2021 at 06:43
Nice Information . if you are looking QuickBooks software solution then you can contact QuickBooks Customer Care Online. at

quickbooks phone number
Simmyjessure12 September 2021 at 05:27
very informative post keep sharing it you can also get most important information Regarding QuickBooks issue at

quickbooks customer service

Rational Java

Pages

Friday, 15 April 2016

JLBH Examples 1 - Why Code Should be Benchmarked in Context

15 comments: