CPU pinning Java threads with jstack and taskset

Published October 17th, 2014

How to discover the native PID of a Java thread and pin it to a CPU using taskset

Recently I've been tuning our market data feed processors. This is a single threaded "straight line speed" application that reads data from the network, parses it, converts to our internal data format, and writes out again.

I've already reduced the time taken for a benchmark workload (parse a large data feed file) by 68% using only code changes (going to blog this soon) but wanted to see if OS-level optimisations could reduce this further.

Here is an experiment into pinning the main application thread to a CPU core to see if that prevents loss of performance when the OS scheduler moves the thread between CPU cores or sockets (thus losing cache contents).

Results were measured using

perf stat -o perf.log java MyWorkloadClass

On Linux you can use the taskset command to pin a process to a CPU core.

First you need to discover the native ID for the Java thread you want to pin.

The steps for doing that are:

1) Get the Java process ID (this assumes you have 1 Java process)

pgrep java

2) Use the jstack tool to create a thread dump for the Java process ID.

jstack -l <pid>

The result will be something like:

"main" #1 prio=5 os_prio=0 tid=0x0000000002128800 nid=0x255b runnable [0x00007f3a00398000]
java.lang.Thread.State: RUNNABLE
...

3) Use grep on the result to get the named thread you want to pin (e.g. main) and grep the line containing nid (the native ID attribute).

grep main <stack dump> | grep nid

4) Use awk to get the nid parameter (and convert from hex to decimal)

awk -F'nid=| runnable' '{printf "%d",$2}'

This can be condensed into one Linux command:

jstack -l `pgrep java` | grep main | grep nid | awk -F'nid=| runnable' '{printf "%d",$2}'

The final step is to take this decimal native Java thread ID and pin it to a CPU core (core 1 here):

taskset -p 0x00000001 <java thread pid>

Here are the results (perf output) for the sample workload with and without CPU pinning:

Pinning the Java thread reduced the workload time from 293 seconds to 225 seconds (23% reduction). Test performed on single socket workstation, not a server.

cat /proc/cpuinfo | grep "model name"
Intel(R) Core(TM) i5 CPU 660 @ 3.33GHz
Intel(R) Core(TM) i5 CPU 660 @ 3.33GHz
Intel(R) Core(TM) i5 CPU 660 @ 3.33GHz
Intel(R) Core(TM) i5 CPU 660 @ 3.33GHz

This is an Intel i5 2 core 4 thread CPU http://ark.intel.com/products/43550/Intel-Core-i5-660-Processor-4M-Cache-3_33-GHz

I will test on a server class machine next week.

Perf stats without CPU pinning

297481.389002 task-clock # 1.014 CPUs utilized
35,674 context-switches # 0.000 M/sec
1,405 CPU-migrations # 0.000 M/sec
110,733 page-faults # 0.000 M/sec
1,063,417,614,122 cycles # 3.575 GHz [83.34%]
161,224,883,847 stalled-cycles-frontend # 15.16% frontend cycles idle [83.34%]
144,636,718,849 stalled-cycles-backend # 13.60% backend cycles idle [66.66%]
2,376,914,490,864 instructions # 2.24 insns per cycle
# 0.07 stalled cycles per insn [83.33%]
657,693,070,884 branches # 2210.871 M/sec [83.33%]
8,735,293,464 branch-misses # 1.33% of all branches [83.34%]

293.502171112 seconds time elapsed

Perf stats with CPU pinning

229904.273198 task-clock # 1.019 CPUs utilized
27,529 context-switches # 0.000 M/sec
1,219 CPU-migrations # 0.000 M/sec
144,662 page-faults # 0.001 M/sec
821,212,115,937 cycles # 3.572 GHz [83.33%]
64,393,588,977 stalled-cycles-frontend # 7.84% frontend cycles idle [83.33%]
48,551,948,602 stalled-cycles-backend # 5.91% backend cycles idle [66.66%]
2,209,615,996,688 instructions # 2.69 insns per cycle
# 0.03 stalled cycles per insn [83.34%]
623,314,562,813 branches # 2711.192 M/sec [83.33%]
366,547,412 branch-misses # 0.06% of all branches [83.35%]

225.562024911 seconds time elapsed

Similar Articles

Cascading OpenJDK builds
Quickly find worst GC pauses in G1 and Parallel GC logs
Identify the Java Thread taking the most CPU
Run an IBM J9 VM in a docker container
DemoFX Part 3
More Bytecode Geekery with JarScan
DemoFX Part 2
OpenJFX Nightly Builds for Linux amd64 and armv6hf for Raspberry Pi
DemoFX Part 1
Add JavaFX support to Azul Systems' Zulu JDK using OpenJFX
JarScan comparison between 8u31 and 8u40 rt.jar
Development Goals for 2015
The power of JIT inlining
Building OpenJDK 9 and OpenJFX from source on Debian
Building hsdis-amd64.dylib on Mac OSX
Java 7 and Java 8 core methods above the default hot method inlining threshold
Can splitting Java core class methods increase performance?
Java acronyms
Location of Java applet log on Windows XP
Examination of PerformanceTools using JITWatch
Building hsdis on Linux amd64 on Debian
Update Java plugin used by OSX
Understanding HotSpot logs
Raspberry Pi TFT hack and video glasses, wearable Pi project
Maven setup
Compiling JDK8 lambdas on Eclipse Luna using Ant
Reading and writing bytes between Java and Obj-C
Reference manual index, Java, PHP, JavaScript
Java JIT analysis and code coverage
Builder Pattern
Refactoring old style Java synchronized code with CopyOnWriteArrayList and Collections.unmodifiableList
JavaScript prototype idiom for OO design
Cross platform SWT dispatch loop idiom
[SOLVED] Java 7 update 21 mixed code warning dialog with signed applet
Clear Linux buffers, cache when benchmarking filesystem
JavaFX MediaPlayer crash (Debian amd64) playing mp3 and displaying graphics
Java on Mac OSX 10.8 Safari broken by XProtect.meta.plist whitelist update
Updated Tutorial for JDK8 early access on Raspberry Pi
Using Java jarsigner to check a jar signature
Eclipse can't find import javafx
JavaFX Ensemble sample won't run from Eclipse due to bad server config
Java applet deployment using deployJava.js
deployJava.js fails to install Java when Java not present
jusched.exe still running after Java uninstalled
High performance modulo operation
Applet graphics corruption in Java 7 update 10, 11, 13 on Mac OSX
ncurses type applications in Java on Raspberry Pi (Lanterna Console)
OpenJDK IcedTea plugin java debugging
Raspberry Pi Java Applets - Iceweasel OpenJDK IcedTea plugin (HardFloat)
[SOLVED] Debian ./java: No such file or directory
Objective C for Java Programmers
Java cheat sheet for angle plotting in degrees, radians, and Pi radians
[SOLVED] Eclipse Java autocomplete not working
Java trace memory leaks with hprof and verbose:gc
Java Unsupported major minor version 51.0
Java AWT Graphics2D anti-aliasing in a Java 1.1 compatible way using reflection
Getting started with Java on Raspberry Pi
Using the Java 1.5 ScheduledExecutorService for scheduling repeating tasks
Java wrapper around an external process
Map JDBC types to Java primitive and Object types
java.io.IOException: Too many open files
Low latency Java tips
Select correct SWT jar for your OS and JVM at runtime
Java can an int fit inside a float or a double without loss of precision?
Java concatenate null String with +=
Use 64 bit Sun Java plugin in Firefox on Ubuntu
openSSL convert PEM certificate and import to Java keystore
Java short form array iteration using for-loop
Java for loop syntax to replace Iterator
Java String.split include empty trailing strings
Java variable scope in switch statement
Java defensive programming - compare variable with constant
SWT Canvas plot centred text
FATAL: sorry, too many clients already (PostgreSQL)
Java numerical overflow
Business Insider's UBS Quant puzzle solution
Java return code in Linux shell script
Java primitives in size order
SWT literal ampersand instead of keyboard hotkey
Elegant use of Java's Math.max() to prevent negative numbers
Java remove non alphanumeric characters from String
SWT best practice - single Display multiple Shells
Ubuntu remove OpenJDK and use Sun Java as default
Java can't delete directory
Deadlock when 2 threads write to the same HashMap