25 Sep, 2012

1 commit

  • Sandy bridge EDAC is calculating the memory size with overflow.
    Basically, the size field and the integer calculation is using 32 bits.
    More bits are needed, when the DIMM memories have high density.

    The net result is that memories are improperly reported there, when
    high-density DIMMs are used:

    EDAC DEBUG: in drivers/edac/sb_edac.c, line at 591: mc#0: channel 0, dimm 0, -16384 Mb (-4194304 pages) bank: 8, rank: 2, row: 0x10000, col: 0x800
    EDAC DEBUG: in drivers/edac/sb_edac.c, line at 591: mc#0: channel 1, dimm 0, -16384 Mb (-4194304 pages) bank: 8, rank: 2, row: 0x10000, col: 0x800

    As the number of pages value is handled at the EDAC core as unsigned
    ints, the driver shows the 16 GB memories at sysfs interface as 16760832
    MB! The fix is simple: calculate the number of pages as unsigned 64-bits
    integer.

    After the patch, the memory size (16 GB) is properly detected:

    EDAC DEBUG: in drivers/edac/sb_edac.c, line at 592: mc#0: channel 0, dimm 0, 16384 Mb (4194304 pages) bank: 8, rank: 2, row: 0x10000, col: 0x800
    EDAC DEBUG: in drivers/edac/sb_edac.c, line at 592: mc#0: channel 1, dimm 0, 16384 Mb (4194304 pages) bank: 8, rank: 2, row: 0x10000, col: 0x800

    Cc: stable@kernel.org
    Signed-off-by: Mauro Carvalho Chehab

    Mauro Carvalho Chehab
     

30 Jul, 2012

1 commit

  • * devel: (33 commits)
    edac i5000, i5400: fix pointer math in i5000_get_mc_regs()
    edac: allow specifying the error count with fake_inject
    edac: add support for Calxeda highbank L2 cache ecc
    edac: add support for Calxeda highbank memory controller
    edac: create top-level debugfs directory
    sb_edac: properly handle error count
    i7core_edac: properly handle error count
    edac: edac_mc_handle_error(): add an error_count parameter
    edac: remove arch-specific parameter for the error handler
    amd64_edac: Don't pass driver name as an error parameter
    edac_mc: check for allocation failure in edac_mc_alloc()
    edac: Increase version to 3.0.0
    edac_mc: Cleanup per-dimm_info debug messages
    edac: Convert debugfX to edac_dbg(X,
    edac: Use more normal debugging macro style
    edac: Don't add __func__ or __FILE__ for debugf[0-9] msgs
    Edac: Add ABI Documentation for the new device nodes
    edac: move documentation ABI to ABI/testing/sysfs-devices-edac
    i7core_edac: change the mem allocation scheme to make Documentation/kobject.txt happy
    edac: change the mem allocation scheme to make Documentation/kobject.txt happy
    ...

    Mauro Carvalho Chehab
     

12 Jun, 2012

5 commits

  • Instead of reporting the error count via driver-specific details,
    use the new way provided by edac_mc_handle_error.

    Signed-off-by: Mauro Carvalho Chehab

    Mauro Carvalho Chehab
     
  • In order to avoid loosing error events, it is desirable to group
    error events together and generate a single trace for several identical
    errors.

    The trace API already allows reporting multiple errors. Change the
    handle_error function to also allow that.

    The changes at the drivers were made by this small script:

    $file .=$_ while (<>);
    $file =~ s/(edac_mc_handle_error)\s*\(([^\,]+)\,([^\,]+)\,/$1($2,$3, 1,/g;
    print $file;

    Signed-off-by: Mauro Carvalho Chehab

    Mauro Carvalho Chehab
     
  • Remove the arch-dependent parameter, as it were not used,
    as the MCE tracepoint weren't implemented. It probably doesn't
    make sense to have an MCE-specific tracepoint, as this will
    cost more bytes at the tracepoint, and tracepoint is not free.

    The changes at the EDAC drivers were done by this small perl script:

    $file .=$_ while (<>);
    $file =~ s/(edac_mc_handle_error)\s*\(([^\;]+)\,([^\,\)]+)\s*\)/$1($2)/g;
    print $file;

    Signed-off-by: Mauro Carvalho Chehab

    Mauro Carvalho Chehab
     
  • Use a more common debugging style.

    Remove __FILE__ uses, add missing newlines,
    coalesce formats and align arguments.

    Signed-off-by: Joe Perches
    Signed-off-by: Mauro Carvalho Chehab

    Joe Perches
     
  • The debug macro already adds that. Most of the work here was
    made by this small script:

    $f .=$_ while (<>);

    $f =~ s/(debugf[0-9]\s*\(\s*)__FILE__\s*": /\1"/g;
    $f =~ s/(debugf[0-9]\s*\(\s*)__FILE__\s*/\1/g;
    $f =~ s/(debugf[0-9]\s*\(\s*)__FILE__\s*"MC: /\1"/g;

    $f =~ s/(debugf[0-9]\s*\(\")\%s[\:\,\(\)]*\s*([^\"]*\s*[^\)]+)__func__\s*\,\s*/\1\2/g;
    $f =~ s/(debugf[0-9]\s*\(\")\%s[\:\,\(\)]*\s*([^\"]*\s*[^\)]+),\s*__func__\s*\)/\1\2)/g;
    $f =~ s/(debugf[0-9]\s*\(\"MC\:\s*)\%s[\:\,\(\)]*\s*([^\"]*\s*[^\)]+)__func__\s*\,\s*/\1\2/g;
    $f =~ s/(debugf[0-9]\s*\(\"MC\:\s*)\%s[\:\,\(\)]*\s*([^\"]*\s*[^\)]+),\s*__func__\s*\)/\1\2)/g;

    $f =~ s/\"MC\: \\n\"/"MC:\\n"/g;

    print $f;

    After running the script, manual cleanups were done to fix it the remaining
    places.

    While here, removed the __LINE__ on most places, as it doesn't actually give
    useful info on most places.

    Signed-off-by: Mauro Carvalho Chehab

    Mauro Carvalho Chehab
     

11 Jun, 2012

3 commits

  • As EDAC doesn't use struct device itself, it created a parent dev
    pointer called as "pdev". Now that we'll be converting it to use
    struct device, instead of struct devsys, this needs to be fixed.

    No functional changes.

    Reviewed-by: Aristeu Rozanski
    Acked-by: Chris Metcalf
    Cc: Doug Thompson
    Cc: Borislav Petkov
    Cc: Mark Gross
    Cc: Jason Uhlenkott
    Cc: Tim Small
    Cc: Ranganathan Desikan
    Cc: "Arvind R."
    Cc: Olof Johansson
    Cc: Egor Martovetsky
    Cc: Michal Marek
    Cc: Jiri Kosina
    Cc: Joe Perches
    Cc: Dmitry Eremin-Solenikov
    Cc: Benjamin Herrenschmidt
    Cc: Hitoshi Mitake
    Cc: Andrew Morton
    Cc: "Niklas Söderlund"
    Cc: Shaohui Xie
    Cc: Josh Boyer
    Cc: linuxppc-dev@lists.ozlabs.org
    Signed-off-by: Mauro Carvalho Chehab

    Mauro Carvalho Chehab
     
  • On SandyBridge, DDRIOA(Dev: 17 Func: 0 Offset: 328) is used
    to detect whether DIMM is RDIMM/LRDIMM, not TA(Dev: 15 Func: 0).

    Signed-off-by: Chen Gong
    Signed-off-by: Mauro Carvalho Chehab

    Chen Gong
     
  • Some edac drivers register themselves as mce decoders via
    notifier_chain. But in current notifier_chain implementation logic,
    it doesn't accept same notifier registered twice. If so, it will be
    wrong when adding/removing the element from the list. For example,
    on one SandyBridge platform, remove module sb_edac and then trigger
    one error, it will hit oops because it has no mce decoder registered
    but related notifier_chain still points to an invalid callback
    function. Here is an example:

    Call Trace:
    [] atomic_notifier_call_chain+0x1a/0x20
    [] mce_log+0x46/0x180
    [] apei_mce_report_mem_error+0x4a/0x60
    [] ghes_do_proc+0x192/0x210
    [] ghes_proc+0x46/0x70
    [] ghes_notify_sci+0x48/0x80
    [] notifier_call_chain+0x55/0x80
    [] __blocking_notifier_call_chain+0x5a/0x80
    [] ? acpi_os_wait_events_complete+0x23/0x23
    [] blocking_notifier_call_chain+0x16/0x20
    [] acpi_hed_notify+0x19/0x1b
    [] acpi_device_notify+0x19/0x1b
    [] acpi_ev_notify_dispatch+0x67/0x7f
    [] acpi_os_execute_deferred+0x29/0x36
    [] process_one_work+0x132/0x450
    [] worker_thread+0x17b/0x3c0
    [] ? manage_workers+0x120/0x120
    [] kthread+0x9e/0xb0
    [] kernel_thread_helper+0x4/0x10
    [] ? kthread_freezable_should_stop+0x70/0x70
    [] ? gs_change+0x13/0x13
    Code: f3 49 89 d4 45 85 ed 4d 89 c6 48 8b 0f 74 48 48 85 c9 75 17 eb 41
    0f 1f 80 00 00 00 00 41 83 ed 01 4c 89 f9 74 22 4d 85 ff 74 1d 8b
    79 08 4c 89 e2 48 89 de 48 89 cf ff 11 4d 85 f6 74 04 41
    RIP [] notifier_call_chain+0x46/0x80
    RSP
    CR2: ffffffffa01af838
    ---[ end trace 0100930068e73e6f ]---
    BUG: unable to handle kernel paging request at fffffffffffffff8
    IP: [] kthread_data+0x10/0x20
    PGD 1a0d067 PUD 1a0e067 PMD 0
    Oops: 0000 [#2] SMP

    Only i7core_edac and sb_edac have such issues because they have more
    than one memory controller which means they have to register mce
    decoder many times.

    Cc: # 3.2 and upper
    Signed-off-by: Chen Gong
    Signed-off-by: Mauro Carvalho Chehab

    Chen Gong
     

30 May, 2012

1 commit

  • Pull EDAC internal API changes from Mauro Carvalho Chehab:
    "This changeset is the first part of a series of patches that fixes the
    EDAC sybsystem. On this set, it changes the Kernel EDAC API in order
    to properly represent the Intel i3/i5/i7, Xeon 3xxx/5xxx/7xxx, and
    Intel E5-xxxx memory controllers.

    The EDAC core used to assume that:

    - the DRAM chip select pin is directly accessed by the memory
    controller

    - when multiple channels are used, they're all filled with the
    same type of memory.

    None of the above premises is true on Intel memory controllers since
    2002, when RAMBUS and FB-DIMMs were introduced, and Advanced Memory
    Buffer or by some similar technologies hides the direct access to the
    DRAM pins.

    So, the existing drivers for those chipsets had to lie to the EDAC
    core, in general telling that just one channel is filled. That
    produces some hard to understand error messages like:

    EDAC MC0: CE row 3, channel 0, label "DIMM1": 1 Unknown error(s): memory read error on FATAL area : cpu=0 Err=0008:00c2 (ch=2), addr = 0xad1f73480 => socket=0, Channel=0(mask=2), rank=1

    The location information there (row3 channel 0) is completely bogus:
    it has no physical meaning, and are just some random values that the
    driver uses to talk with the EDAC core. The error actually happened
    at CPU socket 0, channel 0, slot 1, but this is not reported anywhere,
    as the EDAC core doesn't know anything about the memory layout. So,
    only advanced users that know how the EDAC driver works and that tests
    their systems to see how DIMMs are mapped can actually benefit for
    such error logs.

    This patch series fixes the error report logic, in order to allow the
    EDAC to expose the memory architecture used by them to the EDAC core.
    So, as the EDAC core now understands how the memory is organized, it
    can provide an useful report:

    EDAC MC0: CE memory read error on DIMM1 (channel:0 slot:1 page:0x364b1b offset:0x600 grain:32 syndrome:0x0 - count:1 area:DRAM err_code:0001:0090 socket:0 channel_mask:1 rank:4)

    The location of the DIMM where the error happened is reported by "MC0"
    (cpu socket #0), at "channel:0 slot:1" location, and matches the
    physical location of the DIMM.

    There are two remaining issues not covered by this patch series:

    - The EDAC sysfs API will still report bogus values. So,
    userspace tools like edac-utils will still use the bogus data;

    - Add a new tracepoint-based way to get the binary information
    about the errors.

    Those are on a second series of patches (also at -next), but will
    probably miss the train for 3.5, due to the slow review process."

    Fix up trivial conflict (due to spelling correction of removed code) in
    drivers/edac/edac_device.c

    * git://git.kernel.org/pub/scm/linux/kernel/git/mchehab/linux-edac: (42 commits)
    i7core: fix ranks information at the per-channel struct
    i5000: Fix the fatal error handling
    i5100_edac: Fix a warning when compiled with 32 bits
    i82975x_edac: Test nr_pages earlier to save a few CPU cycles
    e752x_edac: provide more info about how DIMMS/ranks are mapped
    i5000_edac: Fix the logic that retrieves memory information
    i5400_edac: improve debug messages to better represent the filled memory
    edac: Cleanup the logs for i7core and sb edac drivers
    edac: Initialize the dimm label with the known information
    edac: Remove the legacy EDAC ABI
    x38_edac: convert driver to use the new edac ABI
    tile_edac: convert driver to use the new edac ABI
    sb_edac: convert driver to use the new edac ABI
    r82600_edac: convert driver to use the new edac ABI
    ppc4xx_edac: convert driver to use the new edac ABI
    pasemi_edac: convert driver to use the new edac ABI
    mv64x60_edac: convert driver to use the new edac ABI
    mpc85xx_edac: convert driver to use the new edac ABI
    i82975x_edac: convert driver to use the new edac ABI
    i82875p_edac: convert driver to use the new edac ABI
    ...

    Linus Torvalds
     

29 May, 2012

7 commits

  • Remove some information that it is duplicated at the MCE log,
    and don't have much usage for the error. Those data will be
    added again, when creating a trace function that outputs both
    memory errors and MCE fields.

    Cc: Aristeu Rozanski
    Signed-off-by: Mauro Carvalho Chehab

    Mauro Carvalho Chehab
     
  • Now that all drivers got converted to use the new ABI, we can
    drop the old one.

    Acked-by: Chris Metcalf
    Signed-off-by: Mauro Carvalho Chehab

    Mauro Carvalho Chehab
     
  • The legacy edac ABI is going to be removed. Port the driver to use
    and benefit from the new API functionality.

    Signed-off-by: Mauro Carvalho Chehab

    Mauro Carvalho Chehab
     
  • The number of pages is a dimm property. Move it to the dimm struct.

    After this change, it is possible to add sysfs nodes for the DIMM's that
    will properly represent the DIMM stick properties, including its size.

    A TODO fix here is to properly represent dual-rank/quad-rank DIMMs when
    the memory controller represents the memory via chip select rows.

    Reviewed-by: Aristeu Rozanski
    Acked-by: Borislav Petkov
    Acked-by: Chris Metcalf
    Cc: Doug Thompson
    Cc: Mark Gross
    Cc: Jason Uhlenkott
    Cc: Tim Small
    Cc: Ranganathan Desikan
    Cc: "Arvind R."
    Cc: Olof Johansson
    Cc: Egor Martovetsky
    Cc: Michal Marek
    Cc: Jiri Kosina
    Cc: Joe Perches
    Cc: Dmitry Eremin-Solenikov
    Cc: Benjamin Herrenschmidt
    Cc: Hitoshi Mitake
    Cc: Andrew Morton
    Cc: "Niklas Söderlund"
    Cc: Shaohui Xie
    Cc: Josh Boyer
    Cc: linuxppc-dev@lists.ozlabs.org
    Signed-off-by: Mauro Carvalho Chehab

    Mauro Carvalho Chehab
     
  • Almost all edac drivers initialize csrow_info->first_page,
    csrow_info->last_page and csrow_info->page_mask. Those vars are
    used inside the EDAC core, in order to calculate the csrow affected
    by an error, by using the routine edac_mc_find_csrow_by_page().

    However, very few drivers actually use it:
    e752x_edac.c
    e7xxx_edac.c
    i3000_edac.c
    i82443bxgx_edac.c
    i82860_edac.c
    i82875p_edac.c
    i82975x_edac.c
    r82600_edac.c

    There also a few other drivers that have their own calculus
    formula internally using those vars.

    All the others are just wasting time by initializing those
    data.

    While initializing data without using them won't cause any troubles, as
    those information is stored at the wrong place (at csrows structure), it
    is better to remove what is unused, in order to simplify the next patch.

    Reviewed-by: Aristeu Rozanski
    Acked-by: Borislav Petkov
    Acked-by: Chris Metcalf
    Cc: Doug Thompson
    Cc: Hitoshi Mitake
    Cc: Andrew Morton
    Cc: "Niklas Söderlund"
    Cc: Josh Boyer
    Cc: Jiri Kosina
    Signed-off-by: Mauro Carvalho Chehab

    Mauro Carvalho Chehab
     
  • On systems based on chip select rows, all channels need to use memories
    with the same properties, otherwise the memories on channels A and B
    won't be recognized.

    However, such assumption is not true for all types of memory
    controllers.

    Controllers for FB-DIMM's don't have such requirements.

    Also, modern Intel controllers seem to be capable of handling such
    differences.

    So, we need to get rid of storing the DIMM information into a per-csrow
    data, storing it, instead at the right place.

    The first step is to move grain, mtype, dtype and edac_mode to the
    per-dimm struct.

    Reviewed-by: Aristeu Rozanski
    Reviewed-by: Borislav Petkov
    Acked-by: Chris Metcalf
    Cc: Doug Thompson
    Cc: Borislav Petkov
    Cc: Mark Gross
    Cc: Jason Uhlenkott
    Cc: Tim Small
    Cc: Ranganathan Desikan
    Cc: "Arvind R."
    Cc: Olof Johansson
    Cc: Egor Martovetsky
    Cc: Michal Marek
    Cc: Jiri Kosina
    Cc: Joe Perches
    Cc: Dmitry Eremin-Solenikov
    Cc: Benjamin Herrenschmidt
    Cc: Hitoshi Mitake
    Cc: Andrew Morton
    Cc: James Bottomley
    Cc: "Niklas Söderlund"
    Cc: Shaohui Xie
    Cc: Josh Boyer
    Cc: Mike Williams
    Cc: linuxppc-dev@lists.ozlabs.org
    Signed-off-by: Mauro Carvalho Chehab

    Mauro Carvalho Chehab
     
  • The way a DIMM is currently represented implies that they're
    linked into a per-csrow struct. However, some drivers don't see
    csrows, as they're ridden behind some chip like the AMB's
    on FBDIMM's, for example.

    This forced drivers to fake^Wvirtualize a csrow struct, and to create
    a mess under csrow/channel original's concept.

    Move the DIMM labels into a per-DIMM struct, and add there
    the real location of the socket, in terms of csrow/channel.
    Latter patches will modify the location to properly represent the
    memory architecture.

    All other drivers will use a per-csrow type of location.
    Some of those drivers will require a latter conversion, as
    they also fake the csrows internally.

    TODO: While this patch doesn't change the existing behavior, on
    csrows-based memory controllers, a csrow/channel pair points to a memory
    rank. There's a known bug at the EDAC core that allows having different
    labels for the same DIMM, if it has more than one rank. A latter patch
    is need to merge the several ranks for a DIMM into the same dimm_info
    struct, in order to avoid having different labels for the same DIMM.

    The edac_mc_alloc() will now contain a per-dimm initialization loop that
    will be changed by latter patches in order to match other types of
    memory architectures.

    Reviewed-by: Aristeu Rozanski
    Reviewed-by: Borislav Petkov
    Cc: Doug Thompson
    Cc: Ranganathan Desikan
    Cc: "Arvind R."
    Cc: "Niklas Söderlund"
    Signed-off-by: Mauro Carvalho Chehab

    Mauro Carvalho Chehab
     

30 Apr, 2012

1 commit


29 Mar, 2012

1 commit

  • Pull EDAC fixes from Mauro Carvalho Chehab:
    "A series of EDAC driver fixes. It also has one core fix at the
    documentation, and a rename patch, fixing the name of the struct that
    contains the rank information."

    * 'linux_next' of git://git.kernel.org/pub/scm/linux/kernel/git/mchehab/linux-edac:
    edac: rename channel_info to rank_info
    i5400_edac: Avoid calling pci_put_device() twice
    edac: i5100 ack error detection register after each read
    edac: i5100 fix erroneous define for M1Err
    edac: sb_edac: Fix a wrong value setting for the previous value
    edac: sb_edac: Fix a INTERLEAVE_MODE() misuse
    edac: sb_edac: Let the driver depend on PCI_MMCONFIG
    edac: Improve the comments to better describe the memory concepts
    edac/ppc4xx_edac: Fix compilation
    Fix sb_edac compilation with 32 bits kernels

    Linus Torvalds
     

22 Mar, 2012

3 commits

  • >From the driver design, the variable limit wants to compare with its
    previous value, we should set the value of limit instead of the value
    of tmp_mb to the variable prev.

    Signed-off-by: Hui Wang
    Signed-off-by: Mauro Carvalho Chehab

    Hui Wang
     
  • We can identify dram interleave mode from the Dram Rule register
    rather than Dram Interleave list register.

    In this context, the reg of INTERLEAVE_MODE(reg) contains the Dram
    Interleave list register, we can't get interleave mode from the reg,
    while the variable interleave_mode saves the the mode got from the
    Dram Rule register, so we use the variable to replace
    INTERLEAVE_MDDE(reg) here.

    Signed-off-by: Hui Wang
    Signed-off-by: Mauro Carvalho Chehab

    Hui Wang
     
  • As reported by Josh Boyer :
    > drivers/edac/sb_edac.c: In function 'get_memory_error_data':
    > drivers/edac/sb_edac.c:861:2: warning: left shift count >= width of type
    > [enabled by default]
    >
    > ERROR: "__udivdi3" [drivers/edac/sb_edac.ko] undefined!
    > make[1]: *** [__modpost] Error 1
    > make: *** [modules] Error 2

    PS.: compile-tested only

    Reported-by: Josh Boyer
    Signed-off-by: Mauro Carvalho Chehab

    Mauro Carvalho Chehab
     

19 Mar, 2012

1 commit

  • These const tables are currently marked __devinitdata, but
    Documentation/PCI/pci.txt says:

    "o The ID table array should be marked __devinitconst; this is done
    automatically if the table is declared with DEFINE_PCI_DEVICE_TABLE()."

    So use DEFINE_PCI_DEVICE_TABLE(x).

    Based on PaX and earlier work by Andi Kleen.

    Signed-off-by: Lionel Debroux
    Signed-off-by: Borislav Petkov

    Lionel Debroux
     

07 Jan, 2012

1 commit


21 Dec, 2011

1 commit

  • Several fields in struct cpuinfo_x86 were not defined for the
    !SMP case, likely to save space. However, those fields still
    have some meaning for UP, and keeping them allows some #ifdef
    removal from other files. The additional size of the UP kernel
    from this change is not significant enough to worry about
    keeping up the distinction:

    text data bss dec hex filename
    4737168 506459 972040 6215667 5ed7f3 vmlinux.o.before
    4737444 506459 972040 6215943 5ed907 vmlinux.o.after

    for a difference of 276 bytes for an example UP config.

    If someone wants those 276 bytes back badly then it should
    be implemented in a cleaner way.

    Signed-off-by: Kevin Winchester
    Cc: Steffen Persvold
    Link: http://lkml.kernel.org/r/1324428742-12498-1-git-send-email-kjwinchester@gmail.com
    Signed-off-by: Ingo Molnar

    Kevin Winchester
     

14 Dec, 2011

1 commit


01 Nov, 2011

3 commits

  • The edac driver for Sandy Bridge was found to be reporting "FPM"
    for edac_mode, which clearly doesn't make sense. It was found that
    sb_edac.c:get_dimm_config was reusing a variable for both mem_type
    and edac_type, and thus was overwriting the value after setting
    it correctly. This patch fixes that issue.

    Before the patch:
    /sys/devices/system/edac/mc/mc0/csrow0/edac_mode:FPM
    /sys/devices/system/edac/mc/mc0/csrow1/edac_mode:FPM
    /sys/devices/system/edac/mc/mc0/csrow2/edac_mode:FPM
    /sys/devices/system/edac/mc/mc0/csrow3/edac_mode:FPM

    After:
    /sys/devices/system/edac/mc/mc0/csrow0/edac_mode:S4ECD4ED
    /sys/devices/system/edac/mc/mc0/csrow1/edac_mode:S4ECD4ED
    /sys/devices/system/edac/mc/mc0/csrow2/edac_mode:S4ECD4ED
    /sys/devices/system/edac/mc/mc0/csrow3/edac_mode:S4ECD4ED

    Signed-off-by: Mark A. Grondona
    Signed-off-by: Mauro Carvalho Chehab

    Mark A. Grondona
     
  • Some changes on it were required due to changeset cd90cc84c6bf0, that
    changed the glue with the MCE logic.

    Signed-off-by: Mauro Carvalho Chehab

    Mauro Carvalho Chehab
     
  • This driver is known to work on mine and Tony's test environments,
    using software error injection, and a partial hardware/software
    error injection tool.

    There's no broader range test yet to double check if the error decoding
    logic will actually point to the right DIMM, so use it with care.
    More tests are required to be sure that the driver will work on all
    different types of memory configurations.

    If you're willing to risk using it, I suggest you to enable EDAC debugs
    for your test machines, as the debug logs helps to track what's going
    inside the driver.

    Please feed me with bug reports, if you notice that the driver
    is miss-behaving.

    Tested-by: Tony Luck
    Signed-off-by: Mauro Carvalho Chehab

    Mauro Carvalho Chehab