正在切换页面...

LeakCanary: Core Principles and Source Code Deep Dive

hardLeakCanaryMemory LeakPerformanceSource CodeUpdated

In a previous article, we explored the root causes of Android memory leaks and strategies for avoiding common leak patterns. However, as business logic proliferates, manually tracking down leaks becomes practically impossible.

The industry-standard automated detection tool is Square's open-source LeakCanary. It can pinpoint the exact line of code responsible for a leak. This article dissects the source code of LeakCanary 2.x, revealing precisely how it detects leaks "under the hood."

LeakCanary in Practice

Before diving into the internals, let's trace the developer's actual workflow with LeakCanary.

1. Zero-Config Integration

LeakCanary 2.x is the paragon of non-invasive integration. No initialization code in Application is required — just a single Gradle dependency:

dependencies {
  // debug-only: automatically stripped from release builds, zero production overhead
  debugImplementation 'com.squareup.leakcanary:leakcanary-android:2.12'
}

2. Automated Detection and Reporting

After integration, you use the app normally. When you repeatedly enter and exit a leaking Activity/Fragment, LeakCanary silently records the activity. By default, once 5 unreleased leaked objects accumulate:

Brief Freeze: The app pauses for ~1-2 seconds while the VM captures a heap snapshot (Heap Dump).
Notification: A canary bird icon appears in the notification tray.
Report Generation: Tapping the notification opens LeakCanary's built-in visualization UI (the Leaks App).

3. Reading the Leak Trace

The Leaks App displays a Leak Trace — a tree-like reference chain. A typical report:

┬───
│ GC Root: Local variable in thread main
│
├─ android.os.MessageQueue
│    Leaking: NO (MessageQueue is part of the GC Root chain)
│
├─ com.example.MyHandler
│    Leaking: UNKNOWN
│    ↓ MyHandler.mContext
│
╰→ com.example.MainActivity
     Leaking: YES (Activity#mDestroyed is true)

Reading Rule: Trace upward from the bottom, looking for the first node causing Leaking: YES.

Leaking: YES: The object's lifecycle has ended (e.g., mDestroyed = true), but it's still in memory — this is the victim.
Leaking: NO: This object is legitimately alive (e.g., the main thread's MessageQueue).
Leaking: UNKNOWN: Objects in the grey zone.

Case Analysis: MainActivity can't be released because MyHandler holds it via mContext, and MyHandler is held by a pending MessageQueue message. Solution: Convert the Handler to a static inner class with a WeakReference.

4. Monitoring Custom Objects

Beyond automatic monitoring of system components (Activity, Fragment, View, ViewModel), you can manually watch any business object:

// Call when you expect myPresenter to be garbage collected:
AppWatcher.objectWatcher.watch(myPresenter, "MyPresenter should be destroyed")

Core Mechanism: WeakReference + ReferenceQueue

How does LeakCanary know whether an object has been collected? The foundation is WeakReference combined with ReferenceQueue.

The Hotel Checkout Analogy

Imagine a hotel where an Activity is a guest.

When the guest checks out at the front desk (onDestroy is called), they should leave.
The security guard (LeakCanary) attaches a tracking tag (WeakReference) to the guest.
There's a checkout ledger (ReferenceQueue) at the exit. When a tagged guest walks out (is GC'd), the tag automatically falls off and lands in the ledger.
After a delay (e.g., 5 seconds), the guard checks the ledger:
- Tag found in the ledger → Guest left. No leak.
- Tag NOT found, and the tag still exists (WeakReference.get() != null) → Guest checked out but is still wandering the building. Memory leak confirmed!

JVM Mechanism

When the GC collects a weakly-referenced object, the JVM automatically enqueues the WeakReference into its associated ReferenceQueue. LeakCanary leverages this spec-guaranteed behavior: monitoring the ReferenceQueue reveals whether an object was truly collected.

Source Code Deep Dive: How LeakCanary Operates

The complete pipeline consists of four stages: Auto-Initialization → Lifecycle Listening → Leak Detection → Heap Dump & Analysis.

1. Non-Invasive Init: The ContentProvider Trick

How does LeakCanary 2.x initialize without any code in Application? It exploits the fact that ContentProvider.onCreate() executes between Application.attachBaseContext() and Application.onCreate().

// Inside leakcanary-object-watcher-android
internal class AppWatcherInstaller : ContentProvider() {
    override fun onCreate(): Boolean {
        val application = context!!.applicationContext as Application
        // Bootstraps the monitoring system automatically
        AppWatcher.manualInstall(application)
        return true
    }
    // query, insert, etc. all return null/empty
}

When the dependency is included, the manifest merger injects this invisible ContentProvider. The system instantiates it at boot, silently starting the monitoring.

2. Lifecycle Listening: System-Level Callbacks

After initialization, LeakCanary needs to know "when the guest checks out." It registers global lifecycle callbacks:

internal class ActivityDestroyWatcher private constructor(
    private val objectWatcher: ObjectWatcher
) {
    private val lifecycleCallbacks = object : Application.ActivityLifecycleCallbacks by noOpDelegate() {
        override fun onActivityDestroyed(activity: Activity) {
            // When an Activity is destroyed, hand it to ObjectWatcher for tracking
            objectWatcher.watch(
                activity,
                "${activity::class.java.name} received Activity#onDestroy() callback"
            )
        }
    }
}

For Fragments, it uses FragmentManager.registerFragmentLifecycleCallbacks, monitoring onFragmentViewDestroyed and onFragmentDestroyed.

3. Leak Detection: ObjectWatcher and the 5-Second Poll

When an object is destroyed, it's passed to ObjectWatcher.watch() — the heart of the detection mechanism.

Step 1: Attach the Tracking Tag

@Synchronized fun watch(watchedObject: Any, description: String) {
    // 1. Generate a unique UUID identifier
    val key = UUID.randomUUID().toString()
    
    // 2. Wrap in a KeyedWeakReference, associated with the ReferenceQueue
    val reference = KeyedWeakReference(watchedObject, key, description, watchUptimeMillis, queue)
    
    // 3. Store in the retainedReferences map (tracking begins)
    retainedReferences[key] = reference
    
    // 4. Schedule a delayed check (default: 5 seconds later)
    checkRetainedExecutor.execute {
        moveToRetained(key)
    }
}

Step 2: Drain the Checkout Ledger (Core Logic) Before making a leak determination, clean out objects already collected:

private fun removeWeaklyReachableObjects() {
    var ref: KeyedWeakReference?
    // Poll the ReferenceQueue for WeakReferences that the GC has enqueued
    while (queue.poll().also { ref = it as KeyedWeakReference? } != null) {
        // Found in ReferenceQueue = object was collected. Remove from tracking map.
        retainedReferences.remove(ref!!.key)
    }
}

Step 3: Confirm the Leak After 5 seconds, moveToRetained executes. If, after removeWeaklyReachableObjects() drains the queue, the UUID key is still present in retainedReferences, it means the object should be dead but isn't. Leak confirmed. The internal leak counter increments.

4. Heap Dump and the Shark Engine

When the accumulated unreleased leak count exceeds the threshold (default: 5), the final phase triggers:

Heap Dump: Calls Debug.dumpHprofData(path) to export the entire Java heap as a binary .hprof file.
Shark Analysis: LeakCanary 2.x replaced the bloated HAHA library with its in-house Shark engine. Shark uses Kotlin Sequence lazy evaluation to parse multi-megabyte binary files under extremely tight memory constraints.
BFS Graph Traversal: Shark's algorithm is fundamentally a graph search. Starting from all GC Roots (active threads, static variables), it expands outward layer by layer (Breadth-First Search) until it locates the instance tagged with our UUID.
Shortest Path Generation: Because BFS inherently finds the shortest strong reference path, the first path discovered is the most direct chain from GC Root to the leaked object — this becomes the Leak Trace.

Design Tradeoffs: Why This Architecture?

1. Why Accumulate 5 Leaks Before Dumping?

Debug.dumpHprofData() is an extremely heavy operation. During the dump, the ART runtime must suspend all application threads (Suspend All). Dumping on every single detected leak would cause constant freezes during testing. Batching at 5 is a deliberate balance between detection responsiveness and developer experience.

2. Why Not Use `finalize()` to Detect Collection?

The finalize() mechanism in Java is notoriously unreliable: it can cause "object resurrection," increases GC pause times, and its execution timing is non-deterministic. WeakReference + ReferenceQueue is the JVM-standard, lightweight, safe, and non-invasive detection paradigm.

3. Why Shark Over Eclipse MAT?

MAT analyzes the complete heap — every object and every reference — consuming enormous memory. Shark's objective is surgically narrow: find the shortest path between a specific leaked object and a GC Root. It can discard ~90% of the heap data during parsing, dramatically reducing peak memory usage and making on-device analysis feasible.

Summary

In practice, LeakCanary operates like a top-tier private detective embedded within your Android process:

Infiltrates silently via ContentProvider.
Monitors every departing "guest" through LifecycleCallbacks.
Attaches WeakReference trackers and verifies departure via ReferenceQueue.
When a guest goes missing, captures the entire crime scene with Debug.dumpHprofData.
Finally, uses Shark's BFS traversal to uncover the exact reference chain holding the victim hostage.

Understanding LeakCanary is not just about learning a debugging tool — its "non-invasive hook architecture" and "reference queue detection paradigm" constitute a masterclass in systems-level design.

LeakCanary: Core Principles and Source Code Deep Dive

hardLeakCanaryMemory LeakPerformanceSource CodeUpdated

LeakCanary in Practice

Before diving into the internals, let's trace the developer's actual workflow with LeakCanary.

1. Zero-Config Integration

LeakCanary 2.x is the paragon of non-invasive integration. No initialization code in Application is required — just a single Gradle dependency:

dependencies {
  // debug-only: automatically stripped from release builds, zero production overhead
  debugImplementation 'com.squareup.leakcanary:leakcanary-android:2.12'
}

2. Automated Detection and Reporting

Brief Freeze: The app pauses for ~1-2 seconds while the VM captures a heap snapshot (Heap Dump).
Notification: A canary bird icon appears in the notification tray.
Report Generation: Tapping the notification opens LeakCanary's built-in visualization UI (the Leaks App).

3. Reading the Leak Trace

The Leaks App displays a Leak Trace — a tree-like reference chain. A typical report:

┬───
│ GC Root: Local variable in thread main
│
├─ android.os.MessageQueue
│    Leaking: NO (MessageQueue is part of the GC Root chain)
│
├─ com.example.MyHandler
│    Leaking: UNKNOWN
│    ↓ MyHandler.mContext
│
╰→ com.example.MainActivity
     Leaking: YES (Activity#mDestroyed is true)

Reading Rule: Trace upward from the bottom, looking for the first node causing Leaking: YES.

Leaking: YES: The object's lifecycle has ended (e.g., mDestroyed = true), but it's still in memory — this is the victim.
Leaking: NO: This object is legitimately alive (e.g., the main thread's MessageQueue).
Leaking: UNKNOWN: Objects in the grey zone.

4. Monitoring Custom Objects

Beyond automatic monitoring of system components (Activity, Fragment, View, ViewModel), you can manually watch any business object:

// Call when you expect myPresenter to be garbage collected:
AppWatcher.objectWatcher.watch(myPresenter, "MyPresenter should be destroyed")

Core Mechanism: WeakReference + ReferenceQueue

How does LeakCanary know whether an object has been collected? The foundation is WeakReference combined with ReferenceQueue.

The Hotel Checkout Analogy

Imagine a hotel where an Activity is a guest.

When the guest checks out at the front desk (onDestroy is called), they should leave.
The security guard (LeakCanary) attaches a tracking tag (WeakReference) to the guest.
There's a checkout ledger (ReferenceQueue) at the exit. When a tagged guest walks out (is GC'd), the tag automatically falls off and lands in the ledger.
After a delay (e.g., 5 seconds), the guard checks the ledger:
- Tag found in the ledger → Guest left. No leak.
- Tag NOT found, and the tag still exists (WeakReference.get() != null) → Guest checked out but is still wandering the building. Memory leak confirmed!

JVM Mechanism

Source Code Deep Dive: How LeakCanary Operates

The complete pipeline consists of four stages: Auto-Initialization → Lifecycle Listening → Leak Detection → Heap Dump & Analysis.

1. Non-Invasive Init: The ContentProvider Trick

// Inside leakcanary-object-watcher-android
internal class AppWatcherInstaller : ContentProvider() {
    override fun onCreate(): Boolean {
        val application = context!!.applicationContext as Application
        // Bootstraps the monitoring system automatically
        AppWatcher.manualInstall(application)
        return true
    }
    // query, insert, etc. all return null/empty
}

When the dependency is included, the manifest merger injects this invisible ContentProvider. The system instantiates it at boot, silently starting the monitoring.

2. Lifecycle Listening: System-Level Callbacks

After initialization, LeakCanary needs to know "when the guest checks out." It registers global lifecycle callbacks:

internal class ActivityDestroyWatcher private constructor(
    private val objectWatcher: ObjectWatcher
) {
    private val lifecycleCallbacks = object : Application.ActivityLifecycleCallbacks by noOpDelegate() {
        override fun onActivityDestroyed(activity: Activity) {
            // When an Activity is destroyed, hand it to ObjectWatcher for tracking
            objectWatcher.watch(
                activity,
                "${activity::class.java.name} received Activity#onDestroy() callback"
            )
        }
    }
}

For Fragments, it uses FragmentManager.registerFragmentLifecycleCallbacks, monitoring onFragmentViewDestroyed and onFragmentDestroyed.

3. Leak Detection: ObjectWatcher and the 5-Second Poll

When an object is destroyed, it's passed to ObjectWatcher.watch() — the heart of the detection mechanism.

Step 1: Attach the Tracking Tag

@Synchronized fun watch(watchedObject: Any, description: String) {
    // 1. Generate a unique UUID identifier
    val key = UUID.randomUUID().toString()
    
    // 2. Wrap in a KeyedWeakReference, associated with the ReferenceQueue
    val reference = KeyedWeakReference(watchedObject, key, description, watchUptimeMillis, queue)
    
    // 3. Store in the retainedReferences map (tracking begins)
    retainedReferences[key] = reference
    
    // 4. Schedule a delayed check (default: 5 seconds later)
    checkRetainedExecutor.execute {
        moveToRetained(key)
    }
}

Step 2: Drain the Checkout Ledger (Core Logic) Before making a leak determination, clean out objects already collected:

private fun removeWeaklyReachableObjects() {
    var ref: KeyedWeakReference?
    // Poll the ReferenceQueue for WeakReferences that the GC has enqueued
    while (queue.poll().also { ref = it as KeyedWeakReference? } != null) {
        // Found in ReferenceQueue = object was collected. Remove from tracking map.
        retainedReferences.remove(ref!!.key)
    }
}

4. Heap Dump and the Shark Engine

When the accumulated unreleased leak count exceeds the threshold (default: 5), the final phase triggers:

Heap Dump: Calls Debug.dumpHprofData(path) to export the entire Java heap as a binary .hprof file.
Shark Analysis: LeakCanary 2.x replaced the bloated HAHA library with its in-house Shark engine. Shark uses Kotlin Sequence lazy evaluation to parse multi-megabyte binary files under extremely tight memory constraints.
BFS Graph Traversal: Shark's algorithm is fundamentally a graph search. Starting from all GC Roots (active threads, static variables), it expands outward layer by layer (Breadth-First Search) until it locates the instance tagged with our UUID.
Shortest Path Generation: Because BFS inherently finds the shortest strong reference path, the first path discovered is the most direct chain from GC Root to the leaked object — this becomes the Leak Trace.

Design Tradeoffs: Why This Architecture?

1. Why Accumulate 5 Leaks Before Dumping?

2. Why Not Use `finalize()` to Detect Collection?

3. Why Shark Over Eclipse MAT?

Summary

In practice, LeakCanary operates like a top-tier private detective embedded within your Android process:

Infiltrates silently via ContentProvider.
Monitors every departing "guest" through LifecycleCallbacks.
Attaches WeakReference trackers and verifies departure via ReferenceQueue.
When a guest goes missing, captures the entire crime scene with Debug.dumpHprofData.
Finally, uses Shark's BFS traversal to uncover the exact reference chain holding the victim hostage.

LeakCanary in Practice

1. Zero-Config Integration

2. Automated Detection and Reporting

3. Reading the Leak Trace

4. Monitoring Custom Objects

Core Mechanism: WeakReference + ReferenceQueue

The Hotel Checkout Analogy

JVM Mechanism

Source Code Deep Dive: How LeakCanary Operates

1. Non-Invasive Init: The ContentProvider Trick

2. Lifecycle Listening: System-Level Callbacks

3. Leak Detection: ObjectWatcher and the 5-Second Poll

4. Heap Dump and the Shark Engine

Design Tradeoffs: Why This Architecture?

1. Why Accumulate 5 Leaks Before Dumping?

2. Why Not Use finalize() to Detect Collection?

3. Why Shark Over Eclipse MAT?

Summary

LeakCanary in Practice

1. Zero-Config Integration

2. Automated Detection and Reporting

3. Reading the Leak Trace

4. Monitoring Custom Objects

Core Mechanism: WeakReference + ReferenceQueue

The Hotel Checkout Analogy

JVM Mechanism

Source Code Deep Dive: How LeakCanary Operates

1. Non-Invasive Init: The ContentProvider Trick

2. Lifecycle Listening: System-Level Callbacks

3. Leak Detection: ObjectWatcher and the 5-Second Poll

4. Heap Dump and the Shark Engine

Design Tradeoffs: Why This Architecture?

1. Why Accumulate 5 Leaks Before Dumping?

2. Why Not Use finalize() to Detect Collection?

3. Why Shark Over Eclipse MAT?

Summary

2. Why Not Use `finalize()` to Detect Collection?

2. Why Not Use `finalize()` to Detect Collection?