Viestialustana vianhallintajärjestelmät

Xorg log flooded with ”XKB: reuse xkmfile”

13. maaliskuuta 2012 klo 22.37
Sijainti: Vianhallintajärjestelmät: Launchpad

I initially posed this as Question #188982, but now decided to report it as it seems to persist and no definitive answers as to the cause were found. I was apprehensive about reporting at first mainly because this doesn’t seem to occur on my other (Precise-running) computers. It is however 100% reproducible on this one (i.e. the flood occurs on each creation of an Xorg log, though I don’t know what triggers individual lines).

My Xorg log is flooded with lines about reusing xkmfile:

jani@saegusa:~$ grep ”XKB: reuse xkmfile /var/lib/xkb/server-” /var/log/Xorg.0.log | tail
[ 391.894] (II) XKB: reuse xkmfile /var/lib/xkb/server-5CBD5B10CEC815928ACEFB86BAB14051BA0C83FF.xkm
[ 391.901] (II) XKB: reuse xkmfile /var/lib/xkb/server-5CBD5B10CEC815928ACEFB86BAB14051BA0C83FF.xkm
[ 391.906] (II) XKB: reuse xkmfile /var/lib/xkb/server-5CBD5B10CEC815928ACEFB86BAB14051BA0C83FF.xkm
[ 391.909] (II) XKB: reuse xkmfile /var/lib/xkb/server-5CBD5B10CEC815928ACEFB86BAB14051BA0C83FF.xkm
[ 391.914] (II) XKB: reuse xkmfile /var/lib/xkb/server-5CBD5B10CEC815928ACEFB86BAB14051BA0C83FF.xkm
[ 391.919] (II) XKB: reuse xkmfile /var/lib/xkb/server-5CBD5B10CEC815928ACEFB86BAB14051BA0C83FF.xkm
[ 391.922] (II) XKB: reuse xkmfile /var/lib/xkb/server-5CBD5B10CEC815928ACEFB86BAB14051BA0C83FF.xkm
[ 391.926] (II) XKB: reuse xkmfile /var/lib/xkb/server-5CBD5B10CEC815928ACEFB86BAB14051BA0C83FF.xkm
[ 391.929] (II) XKB: reuse xkmfile /var/lib/xkb/server-5CBD5B10CEC815928ACEFB86BAB14051BA0C83FF.xkm
[ 391.933] (II) XKB: reuse xkmfile /var/lib/xkb/server-5CBD5B10CEC815928ACEFB86BAB14051BA0C83FF.xkm
jani@saegusa:~$ grep ”XKB: reuse xkmfile /var/lib/xkb/server-” /var/log/Xorg.0.log | wc -l
1507

(Interestingly I now notice that the number of those lines seems to be constant: the above was copy+paste from terminal just now, and when I posed Question #188982 on 2012-02-27, the result was 1507 also.)

Vastaa viestiin sen kontekstissa (Launchpad)

Just to add that all the icons in Elements panel are expanded

13. maaliskuuta 2012 klo 20.42
Sijainti: Vianhallintajärjestelmät: Launchpad
Avainsanat: Chromium

Just to add that all the icons in Elements panel, not just arrowheads, are expanded, unlike text that ignores Page zoom. Other icons, ugly as they appear when zoomed, do seem to function though.

Vastaa viestiin sen kontekstissa (Launchpad)

Default zoom applied to Elements panel’s arrowheads, breaking their function

13. maaliskuuta 2012 klo 20.37
Sijainti: Vianhallintajärjestelmät: Launchpad
Avainsanat: Chromium

Steps to reproduce:
1. Start Chromium with new (temporary) profile
2. Go to chrome://settings/advanced
3. Set Page zoom to 500%
4. For any page with content, open the Elements panel
5. Try to expand/shrink elements

What happens:
With the expanding/shrinking arrowheads zoomed 500%, but with code still at default zoom, it’s extremely difficult to hit the arrowhead in the right spot to expand/shrink elements. Depending on the selected Page zoom level the actual hotspot may even fall completely outside the arrowhead graphic (to the left from it).

What I expect to happen:
For the arrowheads and the code to both follow the Page zoom setting or ignore it, and either way, for the arrowhead graphic to function as the actual hotspot for expanding/shrinking elements.

Vastaa viestiin sen kontekstissa (Launchpad)

It seems especially prone to occur when I have multiple Gnome terminal windows open

11. maaliskuuta 2012 klo 20.50
Sijainti: Vianhallintajärjestelmät: Launchpad
Avainsanat: Gnome, Unity

I still don’t have a surefire recipe for reproducing this, but it seems especially prone to occur when I have multiple Gnome terminal windows open, or one with multiple tabs in it, in addition to other apps. I have a gut feeling it’s triggered 4/5 times by switching from something else into the set of Gnome terminal windows (with the mouse, via launcher).

Vastaa viestiin sen kontekstissa (Launchpad)

I went back in Precise kernels

9. maaliskuuta 2012 klo 20.04
Sijainti: Vianhallintajärjestelmät: Launchpad
Avainsanat: Intel, Linux

I went back in Precise kernels [1] all the way back to 3.0.0-12.20. There seem to be no easy answers: now even 3.0.0-12.20 crashes with -intel.

I think this either means that the hardware’s broken, or that the issue has been lurking in kernels all the way back to (at least) 3.0.0-12.20, and was only triggered by some early Precise updates (during the time window I described above). As I said, it (definitely) wasn’t there when I filed Bug #903831 on 2011-12-13 (because I couldn’t have gotten far enough to trigger that bug with this on the way).

I’ll attach shots of current results with the early Precise kernels below just in case there’s anything useful there.

I think I’ll try ruling out hardware failure with Oneiric, either with the live disc (if that uses -intel) or by reinstalling.

* [1] https://launchpad.net/ubuntu/precise/+source/linux/

Vastaa viestiin sen kontekstissa (Launchpad)

V3.3-rc6 still crashes

9. maaliskuuta 2012 klo 19.01
Sijainti: Vianhallintajärjestelmät: Launchpad
Avainsanat: Intel, Linux

V3.3-rc6 still crashes, irregardless of RC6 being enabled/disabled.

Vastaa viestiin sen kontekstissa (Launchpad)

Confirming: the fix works.

7. maaliskuuta 2012 klo 21.40
Sijainti: Vianhallintajärjestelmät: Launchpad
Avainsanat: Totem

Confirming: the fix works.

Vastaa viestiin sen kontekstissa (Launchpad)

wistron_btns ”breaks” -pae: floods with ”Unknown key code 10”, causing severe slowdown

3. maaliskuuta 2012 klo 15.01
Sijainti: Vianhallintajärjestelmät: Kernel Bug Tracker
Avainsanat: Linux

My summary’s crap because this is difficult to summarize, hopefully the explanation below makes it clearer. I have little understanding of kernel internals, so I’ll first just try and describe the symptom as it appears.

I’ve come across an issue on my Fujitsu Siemens Amilo M7400 laptop with wistron_btns that is triggered by certain kernels, and once triggered, seems to affect all subsequent attempts to reboot with -pae kernels until a non-pae kernel is booted. I initially reported this on Launchpad [1].

I can currently trigger the issue by (cold or re-) booting 3.2.0-14-pae (these are Ubuntu’s packaged kernels) or by booting (for example) 3.3.0-030300rc4-generic-pae in recovery mode (= ”ro recovery nomodeset”). The recovery boot seems to work normally, but the 3.2.0-14-pae boot already exhibits the failure: it seemingly freezes. (More about the exact nature of ”failure” below.)

Once I’ve triggered the issue, rebooting with any -pae kernel fails similar to how 3.2.0-14-pae behaves irregardless of preceding boots.

I can ”fix” this by booting a non-pae kernel (which never fails). After that subsequent reboots with -pae kernels (apart from 3.2.0-14-pae) no longer fail — not until I do any of the triggering actions again.

Now, the ”failure” looks like a freeze, but it’s actually just an extreme slowdown. With patience, I can actually have the boot finish and can inspect logs. Dmesg reveals that wistron_btns is repeating ”Unknown key code 10” over and over.

If I comment wistron_btns out of /etc/modules so that it isn’t loaded, the issue goes away, meaning I can no longer trigger it.

As I said, I have little understanding of kernel bugs, so what I say next may be completely off, but the way I’ve interpreted this is that the ”brokenness” is actually hidden in the hardware, in something controlled by wistron_btns. Booting 3.2.0-14-pae/recovery booting any -pay puts the controller(?) in a ”broken” state from which a -pae kernel can’t recover, but a non-pae kernel can. And although -pae kernels later than 3.2.0-14 can’t recover a ”broken” controller, they also cannot put it into that ”broken” state (which is a good turn of development).

I’ll be happy to provide more info as requested. I’m attaching dmesg output for
starters.

* [1] https://bugs.launchpad.net/ubuntu/+source/linux/+bug/926012

Vastaa viestiin sen kontekstissa (Kernel Bug Tracker)

With persistent booting I was able to get a panic

3. maaliskuuta 2012 klo 11.25
Sijainti: Vianhallintajärjestelmät: Launchpad
Avainsanat: Intel, Linux

With persistent booting I was able to get a panic [1] showing with 3.3.0-030300rc4, and it looks the same as what the dmesg I posted in #29 [2] showed: print_bad_pte+0x187/0x1e0 is on top the Trace. Despite the numerous boots I was still unable to reproduce the initial printk+0x2d/0x2f, so it may be fixed in Main or masked by the print_bad_pte+0x187/0x1e0 (though this still is based only on two datapoints in a frustratingly random issue).

Whether RC6 is enabled or disabled doesn’t seem to have bearing on this. 3.2.0-17 produces printk+0x2d/0x2f either way [3], and 3.2.0-18.28 also panics, though less consistently: I was only able to produce a sure printk+0x2d/0x2f once [4], with 3.2.0-18.28 non-pae. Mostly the errors fail to reveal themselves, and when they do, they are different from printk+0x2d/0x2f but also from each other: a couple of times a warn_slowpath_common+0x72/0xa0 (as in Bug #917668, though the hardware and pointers are different) occurred [5], and once it was a Bad page map [6] in unity-greeter.

* [1] https://bugs.launchpad.net/ubuntu/+source/linux/+bug/926007/comments/32
* [2] https://bugs.launchpad.net/ubuntu/+source/linux/+bug/926007/comments/29
* [3] https://bugs.launchpad.net/ubuntu/+source/linux/+bug/926007/comments/33
* [4] https://bugs.launchpad.net/ubuntu/+source/linux/+bug/926007/comments/34
* [5] https://bugs.launchpad.net/ubuntu/+source/linux/+bug/926007/comments/35
* [6] https://bugs.launchpad.net/ubuntu/+source/linux/+bug/926007/comments/36

Vastaa viestiin sen kontekstissa (Launchpad)

I’ll upload a bunch of new screenshots for reference

3. maaliskuuta 2012 klo 11.20
Sijainti: Vianhallintajärjestelmät: Launchpad
Avainsanat: Intel, Linux

I’ll upload a bunch of new screenshots for reference. They’re all related to testing this so bear with me, I’ll explain them further after uploading.

Vastaa viestiin sen kontekstissa (Launchpad)

« Uudempia - Vanhempia »