想必做过Android开发的童鞋都遇到过anr(Application Not Responding)。一般的做法就是去/data/anr/trace.txt取对应的 文件。然后,再从这一堆庞大的堆栈信息找到自己想要的结果。但是拿到一堆arn堆栈信息很抓狂,如何从这庞大的堆栈信息找到自己的问题代码呢?这篇文章,给大家分享下如何取定位anr问题。
首先看下堆栈信息
07-26 17:51:04.442: E/ActivityManager(2258): ANR in com.pafinancialtech.zijinbank (com.pafinancialtech.zijinbank/com.pingan.fstandard.paff.activity.MainActivity)07-26 17:51:04.442: E/ActivityManager(2258): PID: 2856007-26 17:51:04.442: E/ActivityManager(2258): Reason: Input dispatching timed out (Waiting to send non-key event because the touched window has not finished processing certain input events that were delivered to it over 500.0ms ago. Wait queue length: 11. Wait queue head age: 5573.8ms.)07-26 17:51:04.442: E/ActivityManager(2258): Load: 5.15 / 3.68 / 3.3807-26 17:51:04.442: E/ActivityManager(2258): CPU usage from 602ms to -6091ms ago:07-26 17:51:04.442: E/ActivityManager(2258): 65% 2258/system_server: 14% user + 50% kernel / faults: 9262 minor07-26 17:51:04.442: E/ActivityManager(2258): 47% 28560/com.pafinancialtech.zijinbank: 10% user + 37% kernel / faults: 4197 minor07-26 17:51:04.442: E/ActivityManager(2258): 11% 4292/com.android.systemui: 5.9% user + 5.6% kernel / faults: 2762 minor07-26 17:51:04.442: E/ActivityManager(2258): 9.2% 4819/com.android.phone: 3.1% user + 6.1% kernel / faults: 3972 minor07-26 17:51:04.442: E/ActivityManager(2258): 1.9% 4774/com.vivo.daemonService: 0.6% user + 1.3% kernel / faults: 3167 minor07-26 17:51:04.442: E/ActivityManager(2258): 3.5% 4618/com.baidu.map.location: 1.7% user + 1.7% kernel / faults: 3635 minor07-26 17:51:04.442: E/ActivityManager(2258): 3.5% 28690/com.pafinancialtech.zijinbank:remote: 2.3% user + 1.1% kernel / faults: 2368 minor07-26 17:51:04.442: E/ActivityManager(2258): 0% 4580/debuggerd64: 0% user + 0% kernel / faults: 2492 minor07-26 17:51:04.442: E/ActivityManager(2258): 0.4% 4842/com.vivo.abe: 0.3% user + 0.1% kernel / faults: 1455 minor07-26 17:51:04.442: E/ActivityManager(2258): 0% 4705/com.android.server.telecom: 0% user + 0% kernel / faults: 1677 minor07-26 17:51:04.442: E/ActivityManager(2258): 0% 4867/com.vivo.fingerprint.fpc.service: 0% user + 0% kernel / faults: 1456 minor07-26 17:51:04.442: E/ActivityManager(2258): 0% 4735/com.qualcomm.qti.services.secureui:sui_service: 0% user + 0% kernel / faults: 1503 minor07-26 17:51:04.442: E/ActivityManager(2258): 1.3% 4165/adbd: 0.1% user + 1.1% kernel / faults: 72 minor07-26 17:51:04.442: E/ActivityManager(2258): 0% 28661/debuggerd: 0% user + 0% kernel / faults: 820 minor07-26 17:51:04.442: E/ActivityManager(2258): 0.5% 24757/logcat: 0.1% user + 0.4% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0.5% 28524/kworker/u16:3: 0% user + 0.5% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0.5% 28909/kworker/u16:4: 0% user + 0.5% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0.4% 181/cfinteractive: 0% user + 0.4% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0.4% 26204/logcat: 0.4% user + 0% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0.1% 4215/VosMCThread: 0% user + 0.1% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0.1% 4217/VosRXThread: 0% user + 0.1% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0.1% 5999/core_ctl/0: 0% user + 0.1% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0.2% 28467/irq/13-synaptic: 0% user + 0.2% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0% 3/ksoftirqd/0: 0% user + 0% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0.1% 6/kworker/u16:0: 0% user + 0.1% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0% 7/migration/0: 0% user + 0% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0% 8/rcu_preempt: 0% user + 0% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0% 19/migration/3: 0% user + 0% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0% 20/ksoftirqd/3: 0% user + 0% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0.1% 35/kworker/u17:0: 0% user + 0.1% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0% 207/mmcqd/0: 0% user + 0% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0.1% 296/surfaceflinger: 0.1% user + 0% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0% 611/kworker/3:2: 0% user + 0% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0% 4216/VosTXThread: 0% user + 0% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0% 5985/migration/5: 0% user + 0% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0% 6033/perfd: 0% user + 0% kernel / faults: 4 minor07-26 17:51:04.442: E/ActivityManager(2258): 0% 27164/kworker/6:0: 0% user + 0% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0.-4% TOTAL: 0% user + 0.-1% kernel + 0.-3% iowait07-26 17:51:04.442: E/ActivityManager(2258): CPU usage from 5196ms to 5716ms later:07-26 17:51:04.442: E/ActivityManager(2258): 23% 4292/com.android.systemui: 13% user + 9.6% kernel / faults: 305 minor07-26 17:51:04.442: E/ActivityManager(2258): 23% 4308/Signal Catcher: 13% user + 9.6% kernel07-26 17:51:04.442: E/ActivityManager(2258): 5.7% 2258/system_server: 1.9% user + 3.8% kernel / faults: 6 minor07-26 17:51:04.442: E/ActivityManager(2258): 1.9% 2346/ActivityManager: 1.9% user + 0% kernel07-26 17:51:04.442: E/ActivityManager(2258): 1.9% 3244/SensorService: 0% user + 1.9% kernel07-26 17:51:04.442: E/ActivityManager(2258): 1.4% 35/kworker/u17:0: 0% user + 1.4% kernel07-26 17:51:04.442: E/ActivityManager(2258): 1.6% 4215/VosMCThread: 0% user + 1.6% kernel07-26 17:51:04.442: E/ActivityManager(2258): 1.9% 4618/com.baidu.map.location: 1.9% user + 0% kernel / faults: 823 minor07-26 17:51:04.442: E/ActivityManager(2258): 1.9% 4647/GCDaemon: 1.9% user + 0% kernel07-26 17:51:04.442: E/ActivityManager(2258): +0% 28983/Thread-37982: 0% user + 0% kernel07-26 17:51:04.442: E/ActivityManager(2258): 1.7% 28524/kworker/u16:3: 0% user + 1.7% kernel07-26 17:51:04.442: E/ActivityManager(2258): 1.7% 28560/com.pafinancialtech.zijinbank: 1.7% user + 0% kernel / faults: 7 minor07-26 17:51:04.442: E/ActivityManager(2258): 1.8% 28690/com.pafinancialtech.zijinbank:remote: 1.8% user + 0% kernel / faults: 274 minor07-26 17:51:04.442: E/ActivityManager(2258): 1.8% 28723/ServiceStartArg: 1.8% user + 0% kernel07-26 17:51:04.442: E/ActivityManager(2258): 0.1% TOTAL: 0% user + 0% kernel + 0.1% iowait
从上面的堆栈信息,我们搜索关键字anr,可以找到
07-26 17:51:04.442: E/ActivityManager(2258): ANR in com.pafinancialtech.zijinbank (com.pafinancialtech.zijinbank/com.pingan.fstandard.paff.activity.MainActivity)
解读:在07-26 17点51分04秒442毫秒的时候 ActivityManager (进程号为2258) 发生了如下错误:com.pafinancialtech.zijinbank 包下面的MainActivity 无响应
第三行
Reason: Input dispatching timed out (Waiting to send non-key event because the touched window has not finished processing certain input events that were delivered to it over 500.0ms ago. Wait queue length: 11. Wait queue head age: 5573.8ms.)
解读:keyDispatchingTimeOut - 按键分配超时
第四行
07-26 17:51:04.442: E/ActivityManager(2258): Load: 5.15 / 3.68 / 3.38
解读:5分钟,10分钟,15分钟内的平均负载分别为:5.15 , 3.68 , 3.38
第五行
07-26 17:51:04.442: E/ActivityManager(2258): CPU usage from 602ms to -6091ms ago:
解读:cpu从602毫秒到6091毫秒之间的使用情况,也就是后面输出的一堆log信息。
至此,我们可以了解大概的原因了,程序肯定有地方做了耗时操作,但是具体在哪部分代码出现的问题呢,继续从堆栈信息查找,然并未找到突破口。既然堆栈信息没有可用信息了,我们再分析虚拟机信息 ,找到从手机pull出来的anr文件, 搜索“Dalvik Thread”关键词,快速定位到本应用程序的虚拟机信息日志,如下:
DALVIK THREADS (91):"main" prio=5 tid=1 Native | group="main" sCount=1 dsCount=0 obj=0x745bd000 self=0xab1951a8 | sysTid=24311 nice=-6 cgrp=apps sched=0/0 handle=0xf716dec8 | state=S schedstat=( 4172755721 1784708550 9755 ) utm=340 stm=77 core=0 HZ=100 | stack=0xff6cf000-0xff6d1000 stackSize=8MB | held mutexes= kernel: __switch_to+0x70/0x7c kernel: binder_thread_read+0x464/0xe8c kernel: binder_ioctl+0x3f8/0x824 kernel: compat_sys_ioctl+0x10c/0x11fc kernel: cpu_switch_to+0x48/0x4c native: #00 pc 00039cc4 /system/lib/libc.so (__ioctl+8) native: #01 pc 0003f4bd /system/lib/libc.so (ioctl+14) native: #02 pc 0001ccd3 /system/lib/libbinder.so (android::IPCThreadState::talkWithDriver(bool)+190) native: #03 pc 0001d347 /system/lib/libbinder.so (android::IPCThreadState::waitForResponse(android::Parcel*, int*)+42) native: #04 pc 0001d509 /system/lib/libbinder.so (android::IPCThreadState::transact(int, unsigned int, android::Parcel const&, android::Parcel*, unsigned int)+124) native: #05 pc 0001872b /system/lib/libbinder.so (android::BpBinder::transact(unsigned int, android::Parcel const&, android::Parcel*, unsigned int)+30) native: #06 pc 00082b53 /system/lib/libandroid_runtime.so (???) native: #07 pc 00a80891 /system/framework/arm/boot.oat (Java_android_os_BinderProxy_transactNative__ILandroid_os_Parcel_2Landroid_os_Parcel_2I+144) at android.os.BinderProxy.transactNative(Native method) at android.os.BinderProxy.transact(Binder.java:496) at android.location.ILocationManager$Stub$Proxy.getFromLocation(ILocationManager.java:777) at android.location.Geocoder.getFromLocation(Geocoder.java:133) at bonree.com.bonree.agent.android.harvest.f.a(unavailable:-1) at bonree.com.bonree.agent.android.harvest.j.onLocationChanged(unavailable:-1) at android.location.LocationManager$ListenerTransport._handleMessage(LocationManager.java:281) at android.location.LocationManager$ListenerTransport.access$000(LocationManager.java:210) at android.location.LocationManager$ListenerTransport$1.handleMessage(LocationManager.java:226) at android.os.Handler.dispatchMessage(Handler.java:102) at android.os.Looper.loop(Looper.java:135) at android.app.ActivityThread.main(ActivityThread.java:5305) at java.lang.reflect.Method.invoke!(Native method) at java.lang.reflect.Method.invoke(Method.java:372) at com.android.internal.os.ZygoteInit$MethodAndArgsCaller.run(ZygoteInit.java:922) at com.android.internal.os.ZygoteInit.main(ZygoteInit.java:717)"Heap thread pool worker thread 0" prio=5 tid=2 Native (still starting up) | group="" sCount=1 dsCount=0 obj=0x0 self=0xab1bffd0 | sysTid=24316 nice=0 cgrp=apps sched=0/0 handle=0xab3f9a28 | state=S schedstat=( 7943593 7965519 46 ) utm=0 stm=0 core=4 HZ=100 | stack=0xf4ca7000-0xf4ca9000 stackSize=1020KB | held mutexes= kernel: __switch_to+0x70/0x7c kernel: futex_wait_queue_me+0xd4/0x12c kernel: futex_wait+0xe0/0x1d4 kernel: do_futex+0xc8/0x8d0 kernel: compat_SyS_futex+0xd0/0x14c kernel: cpu_switch_to+0x48/0x4c native: #00 pc 0000f618 /system/lib/libc.so (syscall+28) native: #01 pc 000a9487 /system/lib/libart.so (art::ConditionVariable::Wait(art::Thread*)+82) native: #02 pc 00230897 /system/lib/libart.so (art::ThreadPool::GetTask(art::Thread*)+50) native: #03 pc 0023083d /system/lib/libart.so (art::ThreadPoolWorker::Run()+52) native: #04 pc 00231079 /system/lib/libart.so (art::ThreadPoolWorker::Callback(void*)+52) native: #05 pc 00012f1b /system/lib/libc.so (__pthread_start(void*)+30) native: #06 pc 00010fdf /system/lib/libc.so (__start_thread+6) (no managed stack frames)"Heap thread pool worker thread 1" prio=5 tid=3 Native (still starting up) | group="" sCount=1 dsCount=0 obj=0x0 self=0xab19bb18 | sysTid=24317 nice=0 cgrp=apps sched=0/0 handle=0xab19ae08 | state=S schedstat=( 7940266 9912290 36 ) utm=0 stm=0 core=4 HZ=100 | stack=0xf4ba5000-0xf4ba7000 stackSize=1020KB | held mutexes= kernel: __switch_to+0x70/0x7c kernel: futex_wait_queue_me+0xd4/0x12c kernel: futex_wait+0xe0/0x1d4 kernel: do_futex+0xc8/0x8d0 kernel: compat_SyS_futex+0xd0/0x14c kernel: cpu_switch_to+0x48/0x4c native: #00 pc 0000f618 /system/lib/libc.so (syscall+28) native: #01 pc 000a9487 /system/lib/libart.so (art::ConditionVariable::Wait(art::Thread*)+82) native: #02 pc 00230897 /system/lib/libart.so (art::ThreadPool::GetTask(art::Thread*)+50) native: #03 pc 0023083d /system/lib/libart.so (art::ThreadPoolWorker::Run()+52) native: #04 pc 00231079 /system/lib/libart.so (art::ThreadPoolWorker::Callback(void*)+52) native: #05 pc 00012f1b /system/lib/libc.so (__pthread_start(void*)+30) native: #06 pc 00010fdf /system/lib/libc.so (__start_thread+6) (no managed stack frames)"Heap thread pool worker thread 2" prio=5 tid=4 Native (still starting up) | group="" sCount=1 dsCount=0 obj=0x0 self=0xab3530a8 | sysTid=24318 nice=0 cgrp=apps sched=0/0 handle=0xab3539c0 | state=S schedstat=( 4911303 7280680 38 ) utm=0 stm=0 core=5 HZ=100 | stack=0xf4aa3000-0xf4aa5000 stackSize=1020KB | held mutexes= kernel: __switch_to+0x70/0x7c kernel: futex_wait_queue_me+0xd4/0x12c kernel: futex_wait+0xe0/0x1d4 kernel: do_futex+0xc8/0x8d0 kernel: compat_SyS_futex+0xd0/0x14c kernel: cpu_switch_to+0x48/0x4c native: #00 pc 0000f618 /system/lib/libc.so (syscall+28) native: #01 pc 000a9487 /system/lib/libart.so (art::ConditionVariable::Wait(art::Thread*)+82) native: #02 pc 00230897 /system/lib/libart.so (art::ThreadPool::GetTask(art::Thread*)+50) native: #03 pc 0023083d /system/lib/libart.so (art::ThreadPoolWorker::Run()+52) native: #04 pc 00231079 /system/lib/libart.so (art::ThreadPoolWorker::Callback(void*)+52) native: #05 pc 00012f1b /system/lib/libc.so (__pthread_start(void*)+30) native: #06 pc 00010fdf /system/lib/libc.so (__start_thread+6) (no managed stack frames)
日志较多,截取一部分,因为我们要分析的是main线程,所以丢掉部分线程代码并不影响我们分析。
每一段都是一个线程,当然我们还是看main线程。通过分析发现突破口在这里:native: #06 pc 00082b53 /system/lib/libandroid_runtime.so (???) native: #07 pc 00a80891 /system/framework/arm/boot.oat (Java_android_os_BinderProxy_transactNative__ILandroid_os_Parcel_2Landroid_os_Parcel_2I+144) at android.os.BinderProxy.transactNative(Native method) at android.os.BinderProxy.transact(Binder.java:496) at android.location.ILocationManager$Stub$Proxy.getFromLocation(ILocationManager.java:777) at android.location.Geocoder.getFromLocation(Geocoder.java:133) at bonree.com.bonree.agent.android.harvest.f.a(unavailable:-1) at bonree.com.bonree.agent.android.harvest.j.onLocationChanged(unavailable:-1) at android.location.LocationManager$ListenerTransport._handleMessage(LocationManager.java:281) at android.location.LocationManager$ListenerTransport.access$000(LocationManager.java:210) at android.location.LocationManager$ListenerTransport$1.handleMessage(LocationManager.java:226) at android.os.Handler.dispatchMessage(Handler.java:102) at android.os.Looper.loop(Looper.java:135) at android.app.ActivityThread.main(ActivityThread.java:5305) at java.lang.reflect.Method.invoke!(Native method) at java.lang.reflect.Method.invoke(Method.java:372) at com.android.internal.os.ZygoteInit$MethodAndArgsCaller.run(ZygoteInit.java:922) at com.android.internal.os.ZygoteInit.main(ZygoteInit.java:717)
可以看到程序调用了Geocoder类的getFromLocation方法,难道系统api也有bug?带着疑问,我们看下Geocoder的源码,如下:
ff39318b51641df88fbf90c468222142.png
从官方文档的介绍来看,getFromLocation方法需要通过网络查找来获取结果,最好在主线程之外的线程来调用这个方法。所以,最可能的原因就是我们程序在主线程调用了这个方法,然后移步到程序调用的地方,发现如下代码
public final void a(Location var1) { if(var1 == null) { this.p.d("Location must not be null"); } else { Geocoder var2 = new Geocoder(this.q); List var3 = null; try { var3 = var2.getFromLocation(var1.getLatitude(), var1.getLongitude(), 1); } catch (IOException var4) { this.p.d("Unable to geocode location: " + var4.toString()); } if(var3 != null && var3.size() != 0) { Address var5; if((var5 = (Address)var3.get(0)) != null) { String var7 = var5.getCountryCode(); String var6 = var5.getAdminArea(); if(var7 != null && var6 != null) { if(var7 == null || var6 == null) { this.p.d("Country code and administrative region are required."); } this.l(); } } } } }
原来确实是程序在主线程调用了此方法,导致主线程被阻塞,发生的ANR.