Advanced 6 min · May 23, 2026

Production Debugging in Spring Boot: Fix It Before It Fixes You

Q: How do I get a thread dump from a Kubernetes pod running Spring Boot?

Run `kubectl exec -- jstack 1 > thread_dump.txt`. The Java process is usually PID 1 inside a container. If jstack isn't available, use `kubectl exec` to a sidecar container with the JDK, or enable Actuator's `/actuator/threaddump` endpoint and curl it.

Q: What's the first thing to check when HikariCP says 'Connection is not available'?

Check the database for active queries: `SELECT * FROM pg_stat_activity WHERE state = 'active'`. If you see long-running queries, find their origin. Check for deadlocks with `SELECT * FROM pg_locks WHERE NOT granted`. Then check your `@Transactional` scopes — are any methods making slow HTTP calls inside a transaction?

Q: Should I use `@EnableTransactionManagement` in Spring Boot?

No, it's automatically enabled in Spring Boot when you have `spring-tx` on the classpath. Adding it manually can cause issues if you set `mode = AspectJ` incorrectly. Leave it to auto-configuration.

Q: What is the difference between Hibernate's `org.hibernate.stat.Statistics` and `spring.jpa.properties.hibernate.generate_statistics`?

The property `hibernate.generate_statistics=true` enables Hibernate's internal statistics collection. You can then access them via the `Statistics` MBean or in logs. It logs every SQL statement, its execution time, and the number of rows fetched. It's the easiest way to find N+1 queries in production.

Q: Why did my Spring Boot app crash with 'Unable to create native thread' even though heap usage was low?

This is an OS-level thread limit, not a JVM heap issue. The JVM creates one thread per HTTP request (Tomcat default). If you have 1000 concurrent requests and the OS thread limit is 1000 (common in containers), you hit it. Check `/proc/sys/kernel/threads-max` and your container's resource limits. Fix: increase container CPU/memory resources or reduce Tomcat's `server.tomcat.threads.max`.

Real production Spring Boot debugging scenarios.

Naren Founder & Principal Engineer

20+ years shipping production Java in banking & fintech. Everything here is grounded in real deployments.

✓ Production

production tested

July 04, 2026

last updated

1,697

articles · all by Naren

Before you start⏱ 30 min

✓Deep production experience
✓Understanding of internals and trade-offs
✓Experience debugging complex systems

● Production Incident 🔎 Debug Guide ⚙ Triage Commands

⚡Quick Answer

Connection pool exhaustion looks like slow everything, not OOM.
Thread dumps + heap dumps are your primary tools. Don't guess.
jstack, jmap, and heap.hprof are the holy trinity.
Slow queries hide behind Hibernate stats. Enable them before the incident.
Rollback a deployment first — it's the fastest diagnostic step you have.

✦ Definition~90s read

What is Spring Boot Real-world Debugging Scenarios?

Production debugging in Spring Boot is the skill of diagnosing and fixing failures under live traffic, without bringing the system down. It's not unit testing. It's not running in your IDE. It's connecting to a running JVM, dumping thread stacks, reading heap dumps, and parsing log levels on the fly.

★

Think of your Spring Boot app as a busy restaurant kitchen.

Real debugging means understanding that OutOfMemoryError: Java heap space is rarely about heap space — it's about a leak, a config mistake, or a runaway batch process. It means knowing that a HikariPool-1 - Connection is not available error often hides behind a deadlocked transaction or a slow query that never returned.

It means having the reflexes to toggle logging.level.org.springframework.transaction.interceptor=TRACE without restarting the pod.

This guide gives you the battle-tested patterns. No theory. No fluff. Just the commands and mental models that prevent 3AM pages.

Plain-English First

Think of your Spring Boot app as a busy restaurant kitchen. If the stove breaks (database connection), every order slows down, but the kitchen still looks busy. You don't fix it by yelling at the waiters — you check the gas line. Debugging in production means finding the real broken pipe, not the symptom.

You just got paged at 2:17 AM. Your service is serving HTTP 503s. The health check is failing, but the CPU is idle and memory usage looks normal. Your first instinct? Restart the pod. Don't. That destroys the evidence.

That crucible moment separates junior ops from senior engineers. The ones who restart are hoping the problem goes away. The ones who first snapshot the JVM state — thread dump, heap dump, JMX metrics — are hunting. I've seen teams burn 8 hours on a mystery that was a single, misconfigured connection pool timeout.

In my second year running a payment processing service, we lost $40k in failed transactions because nobody knew that spring.datasource.hikari.connection-timeout defaults to 30 seconds. Thirty seconds of stalled requests cascading through a cluster. That's the kind of incident that teaches you to read configuration the way a diver reads an oxygen tank gauge.

This isn't a tutorial. It's a field manual. Every pattern here comes from a real outage I personally debugged or cleaned up after. I'm going to show you the exact commands, the exact tools, and the exact thought process. You'll learn why HikariPool-1 - Connection is not available, request timed out after 30000ms is almost never your database's fault. And you'll learn how to prove that in under five minutes.

The Connection Pool is a Liar: How HikariCP Hides Real Problems

HikariCP is the best connection pool in the Java world. It's fast, lightweight, and rarely the source of a bug. But it's the first thing people blame when requests slow down. Stop blaming the pool. The pool is just a mirror. It reflects your database's behavior.

I witnessed a post-mortem where a team doubled maximum-pool-size from 10 to 50, then 50 to 200. The system collapsed faster. Why? They had a single query running full table scans that held locks for 30+ seconds. More connections meant more concurrent slow queries, which meant more contention, which meant more timeouts. The pool wasn't the problem — it was the canary.

Here's the pattern: Threads wait on the pool → pool is exhausted → you think 'need more connections'. Wrong. The real question is: why are existing connections not coming back? Three root causes account for 90% of cases: 1. A @Transactional service method that's too broad, holding a connection while doing I/O (HTTP call, file read). 2. A raw JDBC Connection that was opened but never closed due to an exception skirting the finally block. 3. A database deadlock or long-running query that the pool's connectionTimeout can't outrun.

Fix: Before touching the pool size, find the slow queries. Enable Hibernate stats, read the logs, and find the N+1s or missing indexes. Then, if you still have issues, look at spring.datasource.hikari.maxLifetime — if it's set lower than your DB's wait_timeout, you'll get ghost connections.

Production Trap:

Setting spring.datasource.hikari.connection-timeout to a value higher than your database's wait_timeout or innodb_lock_wait_timeout means the pool will never cleanly time out. You'll get 'Connection is not available' after 30s instead of a fast 5s failure. Always keep connection-timeout ≤ DB lock timeout.

Production Insight

A connection pool exhausted at 20 connections is a crisis. Exhausted at 200 is the same crisis, just more expensive.

Key Takeaway

Before increasing pool size, find the query or transaction holding connections too long. The pool size is a symptom, not the cure.

thecodeforge.io

Spring Boot Scenario Debugging

Thread Dumps Are Your X-Ray: How to Read Them Under Fire

When the app is slow but not dead, and you have no clear error, a thread dump is your only friend. It shows you exactly what every thread is doing at one instant. It's a snapshot of your entire application's state.

I had a situation where a Kafka consumer kept failing to commit offsets. The logs showed 'commit cannot be completed since the group has already rebalanced'. The team spent 2 days tuning max.poll.interval.ms and session.timeout.ms. Nothing worked. I took a thread dump. Found 12 threads in BLOCKED state fighting over a shared HashMap in a singleton bean. The consumers were processing messages slowly because they were waiting on each other to finish writing to the map. The rebalance was a cascade effect.

Tools: jstack <pid> > dump.txt is your first move. On Kubernetes, use kubectl exec <pod> -- jstack 1 > dump.txt (PID 1 is usually the Java process in a container). Then grep for BLOCKED, WAITING, and TIMED_WAITING. Look for threads stuck on org.apache.tomcat.util.threads.TaskQueue.offer or com.zaxxer.hikari.pool.HikariPool.getConnection. Those are your smoking guns.

Pro tip: Take 3 thread dumps, each 10 seconds apart. A single dump can be misleading — a thread might be temporarily waiting on GC. Three dumps show you what's persistent. Compare them. If the same thread is waiting in the same method in all three, you found the culprit.

Senior Shortcut:

In Kubernetes, you don't need to exec into the pod. Use kubectl logs <pod> --previous to get the last log before a crash, but for thread dumps, exec is fine. Or better: expose a Spring Actuator endpoint that dumps threads under security. Then curl it.

Production Insight

A thread dump taken under load is worth 10,000 lines of logs.

Key Takeaway

Learn jstack fluency. Recognize BLOCKED on a synchronized block vs WAITING on a LockSupport.park(). They mean different things.

Heap Dumps: The Art of Finding the 1MB Leak in 25GB

Heap dumps scare most engineers. They're huge, slow to analyze, and the tooling is arcane. But they're the only way to find the memory leak that's been slowly killing your service for days.

I once analyzed a 12GB heap dump from a production instance that was crashing every 31 hours. The dominant class was byte[]. I sorted by retained heap. The top consumer was a HashMap in a singleton bean holding byte[] keys. Turned out a developer had used a ConcurrentHashMap as a cache, but the keys were generated from UUID.randomUUID() — every request created a new key, and nothing ever evicted them. 12GB of orphaned byte arrays.


Start with jmap -dump:live,format=b,file=/tmp/heap.hprof <pid>. The live flag triggers a full GC first, so you only dump live objects. That's critical — it removes the noise of garbage. Then open with Eclipse MAT (or JProfiler if you're on a Mac). Use the 'Leak Suspects' report. It finds the single biggest retained object set. 9 times out of 10, that's your leak.
Common patterns: ThreadLocal variables that accumulate data per request thread (e.g., a MDC context not cleared), HttpClient response bodies that weren't closed, and @Cacheable caches without @CacheEvict or TTL configuration.
Interview Gold:
In an interview, if they ask about memory leaks, say 'The most common cause in Spring Boot is holding references in ThreadLocal or @SessionScope beans without cleaning them up.' Then mention -XX:+HeapDumpOnOutOfMemoryError as the must-have JVM arg.
Production Insight
A heap dump from production is a crime scene. Don't alter the body (restart) before the coroner (MAT) arrives.
Key Takeaway
Don't guess the leak. Let Eclipse MAT's 'Leak Suspects' report tell you. It's rarely where you think.



  thecodeforge.io
  
Spring Boot Scenario Debugging
Actuator is Your Emergency Dashboard: Use It Before It's an Emergency
Never Do This:
Don't expose Actuator endpoints to the public internet without authentication. I've seen a startup using management.endpoints.web.exposure.include=* with no auth. Attackers could change log levels to enable remote code execution via expression injection in some configurations.
Production Insight
If you haven't configured Actuator security on day one, you're building on sand.
Key Takeaway
Enable Actuator from the first commit. Secure it from the second. You'll need thread dumps and log-level changes during an incident, and you want them ready.
Transactional Pitfalls: The @Transactional That Breaks Everything
@Transactional seems simple. It's not. The most common bug I see is a service method annotated @Transactional that internally makes a REST call to another microservice. The HTTP call takes 2 seconds. The database connection stays open for those 2 seconds. That's two seconds of holding a connection that could serve another request.
In a high-traffic system with 10 pool connections and 100 requests/second, only 10 requests can be in the transaction at any time. The rest are queued at the pool. Response times spike. The pool exhausts. Every request fails.
Self-invocation is another classic. If you have a @Transactional method in a Spring bean, and you call it from another method in the same class, the annotation is ignored. Spring's AOP proxies only intercept external calls. I've debugged a production issue where a save() that should have been transactional was not, causing partial writes and data corruption. The fix: either call the method from another class, or inject the proxy (awful pattern, but it works: ((YourService) AopContext.currentProxy()).foo()).
Isolation levels matter too. @Transactional(isolation = Isolation.REPEATABLE_READ) on Postgres can cause serialization failures under high contention could not serialize access due to read/write dependencies. If you don't handle CannotSerializeTransactionException properly, the entire request fails.
Senior Shortcut:
Use @Transactional(propagation = Propagation.REQUIRES_NEW) for long-running sub-operations that should not hold the outer transaction. Or better: extract the DB work into a separate service class. Self-invocation bugs vanish when you inject the other service.
Production Insight
If your transaction holds a connection while it calls someone else's API, you're designing a failure cascade.
Key Takeaway
Transactions should be as narrow as possible. Never mix I/O with DB work in the same transactional boundary.
Configuration That Saves Your Weekend: loggers, delays, and flags
I have a standard set of configurations I add to every Spring Boot project before it goes to production. They've saved me more than once.
spring.datasource.hikari.leak-detection-threshold=30000 — This logs a stack trace when a connection is held longer than 30 seconds. It's the fastest way to find a connection leak. Don't run it in high-traffic dev for long, but turn it on immediately during an incident.
logging.level.org.springframework.transaction.interceptor=TRACE — Logs every transaction begin and commit. It's verbose but invaluable when debugging why a transaction is taking too long.
spring.jpa.properties.hibernate.generate_statistics=true — Logs every query, its timing, and number of rows fetched. You'll see N+1 queries printed as '100 JDBC statements executed for 100 entities'.
server.tomcat.threads.max=200 combined with server.tomcat.accept-count=100 — Controls thread pool size. If your app is I/O heavy (many external calls), increase max threads. If CPU-bound, decrease it. Tune this before you need to.
And the one that's saved me more times than any other: management.endpoint.health.probes.enabled=true. This separates Kubernetes liveness/readiness probes from the health endpoint. You can be 'live' but not 'ready' — for example, when the database is down, you want Kubernetes to stop routing traffic to you (readiness fails) but not kill the pod (liveness passes). This prevents the restart loop during a DB outage.
The Classic Bug:
I once saw a team set leak-detection-threshold=10000 (10 seconds) and forget it in production. It logged a stack trace for every query that took more than 10 seconds. The log volume caused a 200GB disk fill in 4 hours. Use this flag sparingly in production — 30 seconds is safe, 10 seconds is risky.
Production Insight
The difference between a 30-minute debug and a 3-hour one is often a single configuration flag you turned on before the incident.
Key Takeaway
Proactive configuration is better than reactive debugging. Add leak-detection-threshold, generate_statistics, and probes.enabled=true to every production app.
jstack Is Not a Luxury: Why You Need Thread Dumps Before the Fire
Thread dumps are not an autopsy tool. They are a diagnostic scalpel. By the time your production app is crawling, you've already lost context. Jstack gives you a snapshot of every thread's stack trace at that exact moment. You can capture them without restarting, without downtime, and without any fancy tooling. Just kill -3 <pid> and the JVM dumps them to stdout. Or use jstack directly. The real question is: can you read them? Look for threads in BLOCKED or WAITING state that hold locks. A thread that's been in RUNNABLE for ten seconds is your smoking gun. Learn the pattern: one thread holding a lock, dozens waiting. That's a deadlock or a slow database query. Thread dumps are cheap to capture. Run them every 30 seconds during an incident and diff the output. The difference tells the story.
ThreadDumpCapture.javaJAVA
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
// io.thecodeforge — java tutorial
import java.lang.management.ManagementFactory;
import java.lang.management.ThreadInfo;
import java.lang.management.ThreadMXBean;

public class ThreadDumpCapture {
    public static void main(String[] args) {
        ThreadMXBean mxBean = ManagementFactory.getThreadMXBean();
        ThreadInfo[] threads = mxBean.dumpAllThreads(true, true);
        for (ThreadInfo ti : threads) {
            System.out.println("Thread: " + ti.getThreadName() +
                " State: " + ti.getThreadState() +
                " Lock: " + ti.getLockName() +
                " Waited: " + ti.getWaitedCount());
        }
    }
}
Output
Thread: http-nio-8080-exec-1 State: BLOCKED Lock: 0x00000007c2e3f8a0 Waited: 15
Production Trap:
Thread dumps rotate quickly in production logs. Pipe them to a file immediately. 'kill -3' goes to stderr, not stdout. Make sure your logging framework captures stderr with a timestamp.
Key Takeaway
Thread dumps are free, instant, and non-invasive. Always capture a sequence during an incident, never a single snapshot.
The /heapdump Endpoint: Your Get-Out-of-Jail-Free Card
Don't wait for an OutOfMemoryError. By then the GC has already purged the evidence. Actuator's /heapdump endpoint is a live snapshot you can trigger on demand. One curl command, and you get a .hprof file. The trick is knowing when to take it. Right before a full GC? Useless. Right after a memory spike that hasn't triggered GC? Gold. You need a baseline. Take a heap dump on a healthy instance when everything is running fine. Then compare it to the sick one. Look for unexpected byte arrays, strings, or thread-local caches. A common killer: unbounded caches with no eviction policy. If you see a HashMap with 10 million entries, you found your leak. Tools like Eclipse MAT or IntelliJ's profiler will show you the dominator tree. Find the object that holds the most retained heap, then trace it back to the code that created it.
HeapDumpTrigger.javaJAVA
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
// io.thecodeforge — java tutorial
import java.net.http.HttpClient;
import java.net.http.HttpRequest;
import java.net.http.HttpResponse;
import java.net.URI;
import java.nio.file.Paths;
import java.nio.file.Files;

public class HeapDumpTrigger {
    public static void main(String[] args) throws Exception {
        HttpClient client = HttpClient.newHttpClient();
        HttpRequest req = HttpRequest.newBuilder()
            .uri(URI.create("http://localhost:8080/actuator/heapdump"))
            .GET().build();
        HttpResponse<byte[]> resp = client.send(req,
            HttpResponse.BodyHandlers.ofByteArray());
        Files.write(Paths.get("/tmp/heapdump-" + System.currentTimeMillis() + ".hprof"),
            resp.body());
        System.out.println("Heap dump saved: " + resp.headers().firstValue("Content-Length").orElse("N/A"));
    }
}
Output
Heap dump saved: 1.2GB
Production Trap:
Heap dumps are huge and block the JVM during generation. Never enable it on a latency-sensitive endpoint. Use a separate management port and trigger it only when the instance is already failing.
Key Takeaway
Take a healthy baseline heap dump before an incident. Compare the two to find memory leaks in minutes, not hours.



    ● Production incidentPOST-MORTEMseverity: high
The Silent Connection Pool Leak That Took Down Payments
Symptom
Payment processing slowed to a crawl. Intermittent timeouts. Alerts showed Nginx 502s and HikariPool exceptions. CPU and memory looked fine. Datadog showed connection pool saturation, then exhaustion.
Assumption
The database is overloaded. Add more replicas. Increase pool size. Call the DBA at 3 AM.
Root cause
A thread-local connection leak. A developer had opened a DataSourceUtils.getConnection() inside a loop without closing it in a finally block. Intermittent exceptions in the loop skipped the close. The thread-local held the connection, so HikariCP's eviction thread never saw it. After ~500 requests, the pool had 10 leaked connections. After 1000, it was dead.
Fix
1. Deploy a rolling restart to clear leaked connections. 2. Add @Transactional on the service method so Spring manages the connection lifecycle. 3. Replace the raw JDBC code with Spring's JdbcTemplate which automatically closes connections. 4. Set spring.datasource.hikari.leak-detection-threshold=60000 to alert on any connection held >60s.
Key lesson
If you manually open a JDBC connection in Spring Boot, you're writing buggy code.
Use JdbcTemplate or @Transactional.
Always.
There is no exception.

    Production debug guideSymptom → root cause → fix for the failures that actually happen4 entries
Symptom · 01
HTTP requests hang, then timeout after 30s. HikariCP logs: 'Connection is not available, request timed out after 30000ms'.
→
Fix
Don't restart. Run jstack <pid> | grep -A 20 'HikariPool' to see which threads are waiting. Next, check the database with SELECT * FROM pg_stat_activity WHERE state = 'active' for long-running queries or locks. Finally, enable HikariCP leak detection with spring.datasource.hikari.leak-detection-threshold=30000 in your next deployment.
Symptom · 02
OutOfMemoryError: Java heap space with no obvious memory leak in the code.
→
Fix
Immediately add -XX:+HeapDumpOnOutOfMemoryError -XX:HeapDumpPath=/tmp/heap.hprof to JVM args if not already present. Use jcmd <pid> GC.heap_dump /tmp/heap.hprof now. Analyze with Eclipse MAT. Look for byte[] instances held by ThreadLocal or HttpClient response bodies. 90% of the time, it's a stream that wasn't consumed, or a cache without eviction.
Symptom · 03
DB queries run fine in a script but are slow from Spring Boot.
→
Fix
Enable Spring's transaction logging: logging.level.org.springframework.transaction.interceptor=TRACE. Check if your @Transactional(readOnly = true) is actually using a read-only connection — if your DB driver doesn't support it, it's ignored. Profile with spring.jpa.properties.hibernate.generate_statistics=true. Look for N+1 queries printed in logs.
Symptom · 04
Pod restarts without any Java error, CPU spikes to 100% before crash.
→
Fix
This is usually a native memory leak. JVM heap looks fine, but native code (e.g., netty direct buffers, JNI) is leaking off-heap. Check /proc/<pid>/status for VmRSS. Add -XX:NativeMemoryTracking=detail then jcmd <pid> VM.native_memory summary. Common culprits: Spring Boot with Netty, gRPC, or image processing libraries.

    ★ Debug Cheat SheetCommands for fast diagnosis in production
All requests hang → pool exhaustion−
Immediate action
Thread dump. Look for threads waiting on a connection.
Commands
jstack <pid> | grep -A 30 'HikariPool'
SELECT * FROM pg_locks WHERE NOT granted;
Fix now
Rollback deployment or restart. Add leak-detection-threshold=30000.
OOM without obvious heavy object creation+
Immediate action
Heap dump immediately before restart.
Commands
jcmd <pid> GC.heap_dump /tmp/heap.hprof
jcmd <pid> VM.native_memory summary
Fix now
Add -XX:+HeapDumpOnOutOfMemoryError and analyze with MAT.
Slow responses, high RT, low throughput+
Immediate action
Enable Hibernate stats on the fly via Actuator.
Commands
curl -X POST localhost:8081/actuator/loggers/org.hibernate.stat -H 'Content-Type: application/json' -d '{"configuredLevel":"TRACE"}'
tail -f /var/log/app/spring.log | grep 'hibernate.stat'
Fix now
Add spring.jpa.properties.hibernate.generate_statistics=true to config and restart.


    Cache Strategies in Production
Cache Type Spring Annotation Eviction Strategy Best For
In-Memory (Caffeine) @Cacheable + @CacheConfig(cacheNames="...") TTL via spring.cache.caffeine.spec=expireAfterWrite=10m Small, fast-growing caches like user sessions
Redis @Cacheable with RedisCacheManager TTL via RedisTimeToLive; manual @CacheEvict Shared caches across instances, high throughput
Distributed (Hazelcast) @Cacheable with Hazelcast TTL + max-size policy Large, clustered caches needing near-cache speed
Database (JPA 2nd level) @Cacheable with Hibernate Region-based eviction Entity caching that stays consistent with DB writes

    ⚙ Quick Reference
2 commands from this guide
File Command / Code Purpose
ThreadDumpCapture.java public class ThreadDumpCapture { jstack Is Not a Luxury
HeapDumpTrigger.java public class HeapDumpTrigger { The /heapdump Endpoint

    Key takeaways
1Connection pool exhaustion is almost never about the pool size. It's about queries or transactions holding connections too long. Find the slow query first.
2Thread dumps are free and fast. Take three before you touch anything. They show you the real state of the JVM.
3Heap dumps don't lie. Eclipse MAT's 'Leak Suspects' report finds the dominating memory consumer in minutes. Don't guess.
4Narrow your @Transactional boundaries. Never make an HTTP call inside a transaction
you're holding a connection hostage.
5Enable Hibernate statistics and Actuator endpoints on day one. Configure leak-detection-threshold and probes.enabled=true before you need them.

    Common mistakes to avoid
5 patterns
×Using raw JDBC Connection inside a Spring Boot service without closing in finally block
Symptom
HikariPool connection exhaustion after a few hundred requests
Fix
Replace with JdbcTemplate which auto-closes connections, or add @Transactional to the method and inject EntityManager instead.
×Setting spring.datasource.hikari.maximum-pool-size too high (e.g. 200)
Symptom
Database CPU spikes, slow queries, connection storms on DB restart
Fix
Set based on DB core count × 2 + 1 formula. For a typical app with fast queries: 10-20 per instance is enough. Increase sequentially only after proving the bottleneck is pool size.
×Not enabling Hibernate statistics before an incident
Symptom
Cannot trace slow queries or N+1 problems in production
Fix
Add spring.jpa.properties.hibernate.generate_statistics=true to application.yml. Turn it on now. Restart. You'll get detailed query logs.
×Calling a @Transactional method from within the same class (self-invocation)
Symptom
Transaction boundary ignored — partial DB writes, no rollback on exception
Fix
Inject the service into itself (circular dependency) or call the method from a different service. Better: extract transactional logic into a separate DAO/service class.
×Exposing Actuator endpoints without authentication
Symptom
Any external attacker can change log levels, trigger shutdown, or see heap dumps
Fix
Set spring.security.user.name=admin and spring.security.user.password=${ACTUATOR_PASSWORD}. Use a secrets manager for the password. Or lock down via network policies.

    

    

  
    
      INTERVIEW PREP · PRACTICE MODE
      Interview Questions on This Topic
    
    
      
      
    
  

  
  
    Q01SENIOR
A Spring Boot service starts returning 503 errors after running for 12 h...
Q02SENIOR
What's the difference between `@Transactional(propagation = REQUIRES_NEW...
Q03SENIOR
Your Spring Boot app uses HikariCP with maximum-pool-size=10. During a l...
Q04SENIOR
You have a memory leak in a Spring Boot app. The heap dump shows a `Conc...
Q05JUNIOR
How do you hot-change a log level in a running Spring Boot application w...
Q06JUNIOR
What is the difference between `@RestControllerAdvice` and `@ControllerA...
Q07SENIOR
Explain how a thread dump can help debug a Java deadlock. What patterns ...
Q08SENIOR
Your Spring Boot app has a scheduled task (`@Scheduled`) that runs every...
    Q01 of 08SENIOR
A Spring Boot service starts returning 503 errors after running for 12 hours. Thread dumps show many threads in BLOCKED state on a ReentrantLock inside HikariPool. What do you check first?
ANSWER
I check the database for long-running queries or deadlocks. The pool lock is not the root cause — it's a consequence. Use SELECT * FROM pg_stat_activity WHERE state != 'idle' to find queries running longer than 30 seconds. Then check if any @Transactional method is making a slow HTTP call while holding a connection.
Q02 of 08SENIOR
What's the difference between `@Transactional(propagation = REQUIRES_NEW)` and `@Transactional(propagation = NESTED)` in Spring? When would you use each?
ANSWER
REQUIRES_NEW suspends the outer transaction and starts a new one; the inner commit happens independently of the outer. NESTED uses a savepoint — the inner can roll back without affecting the outer, but both commit together. Use REQUIRES_NEW for a sub-operation that must be persisted regardless of the outer transaction's outcome (e.g., audit log). Use NESTED for error-handling where you want to roll back a sub-operation but not the entire request.
Q03 of 08SENIOR
Your Spring Boot app uses HikariCP with maximum-pool-size=10. During a load test, you see 'Connection is not available' after 30 seconds. You double the pool to 20 and the problem worsens. Why?
ANSWER
The real issue is likely a long-running query or a transaction held open too long. Adding more connections just increases the number of concurrent slow operations, leading to more database contention and slower response times. I would profile the database first — check pg_stat_activity for active queries, enable Hibernate stats to find slow queries, and ensure @Transactional scopes are narrow.
Q04 of 08SENIOR
You have a memory leak in a Spring Boot app. The heap dump shows a `ConcurrentHashMap` with 200 million `String` entries. The map is held by a `@Service` bean. The keys are UUIDs. What do you infer?
ANSWER
The map is being used as a cache or accumulator, but there's no eviction — the keys are unique per request (UUIDs), so the map grows indefinitely. Fix: Add a TTL or LRU eviction with Caffeine, or use Spring's @Cacheable with expireAfterWrite. Also check that the @Service is not @SessionScoped accidentally.
Q05 of 08JUNIOR
How do you hot-change a log level in a running Spring Boot application without restarting?
ANSWER
Use the Actuator /loggers endpoint. POST to /actuator/loggers/com.example.myservice with a JSON body {"configuredLevel": "TRACE"}. Make sure you have authentication on the endpoint. This can be done via curl or a shell script during an incident.
Q06 of 08JUNIOR
What is the difference between `@RestControllerAdvice` and `@ControllerAdvice` in Spring Boot?
ANSWER
@RestControllerAdvice is a convenience annotation combining @ControllerAdvice and @ResponseBody. It's used for REST APIs where you want exception handlers to return JSON or XML bodies. @ControllerAdvice without @ResponseBody would return a view name. In modern Spring Boot, @RestControllerAdvice is preferred for APIs.
Q07 of 08SENIOR
Explain how a thread dump can help debug a Java deadlock. What patterns do you look for?
ANSWER
A thread dump from a deadlocked JVM will show at least two threads in BLOCKED state, each waiting on a monitor that the other holds. Look for threads in BLOCKED on synchronized blocks or ReentrantLock. Each thread's stack trace will show the monitor it's waiting for (- waiting to lock <0x...>) and the monitor it holds (- locked <0x...>). You need at least two dumps to confirm the pattern is stable.
Q08 of 08SENIOR
Your Spring Boot app has a scheduled task (`@Scheduled`) that runs every minute. Over weeks, response times degrade. What do you suspect?
ANSWER
The scheduled task might be a memory leak or a resource leak. Check if the task accumulates data in a collection or opens streams/files without closing them. Also verify the thread pool for @Scheduled — by default, it's a single thread, so if the task takes longer than 1 minute, it backs up. Fix: add spring.task.scheduling.pool.size=5 to allow concurrent scheduled tasks.
  

  
  
    01
A Spring Boot service starts returning 503 errors after running for 12 hours. Thread dumps show many threads in BLOCKED state on a ReentrantLock inside HikariPool. What do you check first?
SENIOR
02
What's the difference between `@Transactional(propagation = REQUIRES_NEW)` and `@Transactional(propagation = NESTED)` in Spring? When would you use each?
SENIOR
03
Your Spring Boot app uses HikariCP with maximum-pool-size=10. During a load test, you see 'Connection is not available' after 30 seconds. You double the pool to 20 and the problem worsens. Why?
SENIOR
04
You have a memory leak in a Spring Boot app. The heap dump shows a `ConcurrentHashMap` with 200 million `String` entries. The map is held by a `@Service` bean. The keys are UUIDs. What do you infer?
SENIOR
05
How do you hot-change a log level in a running Spring Boot application without restarting?
JUNIOR
06
What is the difference between `@RestControllerAdvice` and `@ControllerAdvice` in Spring Boot?
JUNIOR
07
Explain how a thread dump can help debug a Java deadlock. What patterns do you look for?
SENIOR
08
Your Spring Boot app has a scheduled task (`@Scheduled`) that runs every minute. Over weeks, response times degrade. What do you suspect?
SENIOR
  



    

  
    FAQ · 5 QUESTIONS
    Frequently Asked Questions
  
  01
How do I get a thread dump from a Kubernetes pod running Spring Boot?
Run kubectl exec <pod-name> -- jstack 1 > thread_dump.txt. The Java process is usually PID 1 inside a container. If jstack isn't available, use kubectl exec to a sidecar container with the JDK, or enable Actuator's /actuator/threaddump endpoint and curl it.
Was this helpful? 
02
What's the first thing to check when HikariCP says 'Connection is not available'?
Check the database for active queries: SELECT  FROM pg_stat_activity WHERE state = 'active'. If you see long-running queries, find their origin. Check for deadlocks with SELECT  FROM pg_locks WHERE NOT granted. Then check your @Transactional scopes — are any methods making slow HTTP calls inside a transaction?
Was this helpful? 
03
Should I use `@EnableTransactionManagement` in Spring Boot?
No, it's automatically enabled in Spring Boot when you have spring-tx on the classpath. Adding it manually can cause issues if you set mode = AspectJ incorrectly. Leave it to auto-configuration.
Was this helpful? 
04
What is the difference between Hibernate's `org.hibernate.stat.Statistics` and `spring.jpa.properties.hibernate.generate_statistics`?
The property hibernate.generate_statistics=true enables Hibernate's internal statistics collection. You can then access them via the Statistics MBean or in logs. It logs every SQL statement, its execution time, and the number of rows fetched. It's the easiest way to find N+1 queries in production.
Was this helpful? 
05
Why did my Spring Boot app crash with 'Unable to create native thread' even though heap usage was low?
This is an OS-level thread limit, not a JVM heap issue. The JVM creates one thread per HTTP request (Tomcat default). If you have 1000 concurrent requests and the OS thread limit is 1000 (common in containers), you hit it. Check /proc/sys/kernel/threads-max and your container's resource limits. Fix: increase container CPU/memory resources or reduce Tomcat's server.tomcat.threads.max.
Was this helpful? 



    📖 What to Read Next
→Java Memory Leaks: The Production Field GuideDeep dive on identifying and fixing memory leaks in real Java applications, with heap dump analysis patterns.→Spring Boot Actuator: The Security Risks You Didn't Know AboutWhy exposing Actuator without auth is a security vulnerability, and how to properly secure it.→Transactional Misconceptions in Spring BootCommon `@Transactional` mistakes, including self-invocation, propagation misconfigurations, and cross-service calls.
From Other Categories
Doubly Linked ListIntermediate
    

    
    
      
        N
        
          
            Naren
            Founder & Principal Engineer
          
          
            20+ years shipping production Java in banking & fintech. Everything here is grounded in real deployments.
          
        
        
          
          Follow
        
      
      
        
          
            ✓ Verified
          
          production tested
        
        
          July 04, 2026
          last updated
        
        
          1,697
          articles · all by Naren
        
      
    

    
    
      🔥
      
        That's Production. Mark it forged?
      
      
        6 min read · try the examples if you haven't

Cache Type	Spring Annotation	Eviction Strategy	Best For
In-Memory (Caffeine)	@Cacheable + @CacheConfig(cacheNames="...")	TTL via spring.cache.caffeine.spec=expireAfterWrite=10m	Small, fast-growing caches like user sessions
Redis	@Cacheable with RedisCacheManager	TTL via RedisTimeToLive; manual @CacheEvict	Shared caches across instances, high throughput
Distributed (Hazelcast)	@Cacheable with Hazelcast	TTL + max-size policy	Large, clustered caches needing near-cache speed
Database (JPA 2nd level)	@Cacheable with Hibernate	Region-based eviction	Entity caching that stays consistent with DB writes

File	Command / Code	Purpose
ThreadDumpCapture.java	public class ThreadDumpCapture {	jstack Is Not a Luxury
HeapDumpTrigger.java	public class HeapDumpTrigger {	The /heapdump Endpoint