16 Sep, 2009

1 commit


15 Sep, 2009

2 commits

  • * 'for-2.6.32' of git://git.kernel.dk/linux-2.6-block: (29 commits)
    block: use blkdev_issue_discard in blk_ioctl_discard
    Make DISCARD_BARRIER and DISCARD_NOBARRIER writes instead of reads
    block: don't assume device has a request list backing in nr_requests store
    block: Optimal I/O limit wrapper
    cfq: choose a new next_req when a request is dispatched
    Seperate read and write statistics of in_flight requests
    aoe: end barrier bios with EOPNOTSUPP
    block: trace bio queueing trial only when it occurs
    block: enable rq CPU completion affinity by default
    cfq: fix the log message after dispatched a request
    block: use printk_once
    cciss: memory leak in cciss_init_one()
    splice: update mtime and atime on files
    block: make blk_iopoll_prep_sched() follow normal 0/1 return convention
    cfq-iosched: get rid of must_alloc flag
    block: use interrupts disabled version of raise_softirq_irqoff()
    block: fix comment in blk-iopoll.c
    block: adjust default budget for blk-iopoll
    block: fix long lines in block/blk-iopoll.c
    block: add blk-iopoll, a NAPI like approach for block devices
    ...

    Linus Torvalds
     
  • * git://git.kernel.org/pub/scm/linux/kernel/git/jejb/scsi-misc-2.6: (209 commits)
    [SCSI] fix oops during scsi scanning
    [SCSI] libsrp: fix memory leak in srp_ring_free()
    [SCSI] libiscsi, bnx2i: make bound ep check common
    [SCSI] libiscsi: add completion function for drivers that do not need pdu processing
    [SCSI] scsi_dh_rdac: changes for rdac debug logging
    [SCSI] scsi_dh_rdac: changes to collect the rdac debug information during the initialization
    [SCSI] scsi_dh_rdac: move the init code from rdac_activate to rdac_bus_attach
    [SCSI] sg: fix oops in the error path in sg_build_indirect()
    [SCSI] mptsas : Bump version to 3.04.12
    [SCSI] mptsas : FW event thread and scsi mid layer deadlock in SYNCHRONIZE CACHE command
    [SCSI] mptsas : Send DID_NO_CONNECT for pending IOs of removed device
    [SCSI] mptsas : PAE Kernel more than 4 GB kernel panic
    [SCSI] mptsas : NULL pointer on big endian systems causing Expander not to tear off
    [SCSI] mptsas : Sanity check for phyinfo is added
    [SCSI] scsi_dh_rdac: Add support for Sun StorageTek ST2500, ST2510 and ST2530
    [SCSI] pmcraid: PMC-Sierra MaxRAID driver to support 6Gb/s SAS RAID controller
    [SCSI] qla2xxx: Update version number to 8.03.01-k6.
    [SCSI] qla2xxx: Properly delete rports attached to a vport.
    [SCSI] qla2xxx: Correct various NPIV issues.
    [SCSI] qla2xxx: Correct qla2x00_eh_wait_on_command() to wait correctly.
    ...

    Linus Torvalds
     

12 Sep, 2009

16 commits

  • Chris Webb reported:
    p0# uname -a
    Linux f7ea8425-d45b-490f-a738-d181d0df6963.host.elastichosts.com 2.6.30.4-elastic-lon-p #2 SMP PREEMPT Thu Aug 20 14:30:50 BST 2009 x86_64 Intel(R) Xeon(R) CPU E5420 @ 2.50GHz GenuineIntel GNU/Linux
    p0# zgrep SCAN_ASYNC /proc/config.gz
    # CONFIG_SCSI_SCAN_ASYNC is not set

    p0# cat /var/log/kern/2009-08-20
    [...]
    15:27:10.485 kernel: scsi9 : iSCSI Initiator over TCP/IP
    15:27:11.493 kernel: scsi 9:0:0:0: RAID IET Controller 0001 PQ: 0 ANSI: 5
    15:27:11.493 kernel: scsi 9:0:0:0: Attached scsi generic sg6 type 12
    15:27:11.495 kernel: scsi 9:0:0:1: Direct-Access IET VIRTUAL-DISK 0001 PQ: 0 ANSI: 5
    15:27:11.495 kernel: sd 9:0:0:1: Attached scsi generic sg7 type 0
    15:27:11.495 kernel: sd 9:0:0:1: [sdg] 4194304 512-byte hardware sectors: (2.14 GB/2.00 GiB)
    15:27:11.495 kernel: sd 9:0:0:1: [sdg] Write Protect is off
    15:27:11.495 kernel: sd 9:0:0:1: [sdg] Write cache: disabled, read cache: enabled, doesn't support DPO or FUA
    15:27:13.012 kernel: sdg:scsi 9:0:0:1: [sdg] Unhandled error code
    15:27:13.012 kernel: scsi 9:0:0:1: [sdg] Result: hostbyte=0x07 driverbyte=0x00
    15:27:13.012 kernel: end_request: I/O error, dev sdg, sector 0
    15:27:13.012 kernel: Buffer I/O error on device sdg, logical block 0
    15:27:13.012 kernel: ldm_validate_partition_table(): Disk read failed.
    15:27:13.012 kernel: unable to read partition table
    15:27:13.014 kernel: BUG: unable to handle kernel NULL pointer dereference at 0000000000000010
    15:27:13.014 kernel: IP: [] disk_part_iter_next+0x74/0xfd
    15:27:13.014 kernel: PGD 82ad0b067 PUD 82cd7e067 PMD 0
    15:27:13.014 kernel: Oops: 0000 [#1] PREEMPT SMP
    15:27:13.014 kernel: last sysfs file: /sys/devices/platform/host9/session4/iscsi_session/session4/ifacename
    15:27:13.014 kernel: CPU 5
    15:27:13.014 kernel: Modules linked in:
    15:27:13.014 kernel: Pid: 13999, comm: async/0 Not tainted 2.6.30.4-elastic-lon-p #2 X7DBN
    15:27:13.014 kernel: RIP: 0010:[] [] disk_part_iter_next+0x74/0xfd
    15:27:13.014 kernel: RSP: 0018:ffff88066afa3dd0 EFLAGS: 00010246
    15:27:13.014 kernel: RAX: ffff88082b58a000 RBX: ffff88066afa3e00 RCX: 0000000000000000
    15:27:13.014 kernel: RDX: 0000000000000000 RSI: ffff88082b58a000 RDI: 0000000000000000
    15:27:13.014 kernel: RBP: ffff88066afa3df0 R08: ffff88066afa2000 R09: ffff8806a204f000
    15:27:13.014 kernel: R10: 000000fb12c7d274 R11: ffff8806c2bf0628 R12: ffff88066afa3e00
    15:27:13.014 kernel: R13: ffff88082c829a00 R14: 0000000000000000 R15: ffff8806bc50c920
    15:27:13.014 kernel: FS: 0000000000000000(0000) GS:ffff88002818a000(0000) knlGS:0000000000000000
    15:27:13.014 kernel: CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
    15:27:13.014 kernel: CR2: 0000000000000010 CR3: 000000082ade3000 CR4: 00000000000426e0
    15:27:13.014 kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
    15:27:13.014 kernel: DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
    15:27:13.014 kernel: Process async/0 (pid: 13999, threadinfo ffff88066afa2000, task ffff8806c2bf05e0)
    15:27:13.014 kernel: Stack:
    15:27:13.014 kernel: 0000000000000000 ffff88066afa3e00 ffff88066afa3e00 ffff88082c829a00
    15:27:13.014 kernel: ffff88066afa3e40 ffffffff80306feb ffff88082b58a000 0000000000000000
    15:27:13.014 kernel: 0000000000000001 ffff8806bc50c920 ffff88066afa3e40 ffff88082b58a000
    15:27:13.014 kernel: Call Trace:
    15:27:13.014 kernel: [] register_disk+0x122/0x13a
    15:27:13.014 kernel: [] add_disk+0xaa/0x106
    15:27:13.014 kernel: [] sd_probe_async+0x198/0x25b
    15:27:13.014 kernel: [] async_thread+0x10c/0x20d
    15:27:13.014 kernel: [] ? default_wake_function+0x0/0xf
    15:27:13.014 kernel: [] ? async_thread+0x0/0x20d
    15:27:13.014 kernel: [] kthread+0x55/0x80
    15:27:13.014 kernel: [] child_rip+0xa/0x20
    15:27:13.014 kernel: [] ? kthread+0x0/0x80
    15:27:13.014 kernel: [] ? child_rip+0x0/0x20
    15:27:13.014 kernel: Code: c8 ff 80 e1 0c b9 00 00 00 00 0f 44 c1 41 83 cd ff 48 8d 7a 20 48 be ff ff ff ff 08 00 00 00 48 b9 00 00 00 00 08 00 00 00 eb 50 42 10 41 bd 01 00 00 00 eb db 4c 63 c2 4e 8d 04 c7 4d 8b 20
    15:27:13.015 kernel: RIP [] disk_part_iter_next+0x74/0xfd
    15:27:13.015 kernel: RSP
    15:27:13.015 kernel: CR2: 0000000000000010
    15:27:13.015 kernel: ---[ end trace 6104b56ef5590e25 ]---

    The problem is caused because the async scanning split in sd.c doesn't hold
    any reference to the device when it kicks off the async piece. What's
    happening is that an iSCSI disconnect is destorying the device again *before*
    the async sd scanning thread even starts. Fix this by taking a reference
    before starting the thread and dropping it again when the thread completes.

    Reported-by: Chris Webb
    Cc: Stable Tree
    Signed-off-by: James Bottomley

    James Bottomley
     
  • This patch fixes a memory leak in the libsrp function srp_ring_free().
    It is not documented whether or not this function should free the ring
    pointer itself. But the source code of the callers of this function
    (srp_target_alloc() and srp_target_free()) makes it clear that
    srp_ring_free() should deallocate the ring pointer itself. Furthermore,
    the patch below makes srp_ring_free() deallocate all memory allocated by
    srp_ring_alloc().

    This patch affects the ibmvstgt driver, which is the only in-tree driver
    that calls the srp_ring_free() function (indirectly).

    Signed-off-by: Bart Van Assche
    Acked-by: FUJITA Tomonori
    Cc: Stable Tree
    Signed-off-by: James Bottomley

    Bart Van Assche
     
  • bnx2i currently has a check for if a ep is properly bound, so if
    iscsi_queuecommand/xmit_task is called while there is no ep
    we will not queue IO.

    be2iscsi sends IO from queuecommand/xmit_task like how bnx2i does
    and needs a similar test. This patch has us just use the suspend_bit
    test for this.

    When ep_poll has succeeed iscsid will call conn_bind, the LLD will
    then call iscsi_conn_bind which will clear the suspend bit.
    When ep_disconnect is called (or if there is a conn error) we set
    the suspend bit. For the ep_disconnect case I am adding a helper
    in this patch that will take the session lock to make sure
    iscsi_queuecommand/xmit_task is not running and it will set
    the suspend bit.

    Signed-off-by: Mike Christie
    Signed-off-by: Jayamohan Kallickal
    Signed-off-by: James Bottomley

    Mike Christie
     
  • beiscsi does not need the iscsi scsi cmd processing. It does not
    even get this info on the completion path. This adds a function
    to just update the sequencing numbers and complete a task.

    Signed-off-by: Mike Christie
    Signed-off-by: Jayamohan Kallickal
    Signed-off-by: James Bottomley

    Mike Christie
     
  • Patch to add debugging stuff for rdac device handler.
    - Added a bit mask "module parameter" rdac_logging with 2 bits for each type
    of logging.
    - currently defined only two types of logging(failover and sense logging). Can
    be enhanced later if required.
    - By default only failover logging is enabled which is equivalent of current
    logging.

    Signed-off-by: Babu Moger
    Reviewed-by: Vijay Chauhan
    Reviewed-by: Bob Stankey
    Signed-off-by: James Bottomley

    Moger, Babu
     
  • Adding the code to read the debug information during initialization. This
    patch collects the information about storage and controllers during
    rdac_activate.

    Signed-off-by: Babu Moger
    Reviewed-by: Vijay Chauhan
    Reviewed-by: Bob Stankey
    Signed-off-by: James Bottomley

    Moger, Babu
     
  • Moving the initialization code from rdac_activate to rdac_bus_attach which is
    more efficient. We don't have to collect all the information during every
    activate.

    Signed-off-by: Babu Moger
    Reviewed-by: Vijay Chauhan
    Reviewed-by: Bob Stankey
    Signed-off-by: James Bottomley

    Moger, Babu
     
  • When the allocation fails in sg_build_indirect(), an oops happens in
    the error path. It's caused by an obvious typo.

    Signed-off-by: Michal Schmidt
    Reported-by: Bob Tracy
    Acked-by: Douglas Gilbert
    Cc: Stable Tree
    Signed-off-by: James Bottomley

    Michal Schmidt
     
  • These storage arrays can use RDAC when configured with 'linux' as the
    initiator.

    http://www.sun.com/storage/disk_systems/workgroup/2500/
    http://www.sun.com/storage/disk_systems/workgroup/2510/
    http://www.sun.com/storage/disk_systems/workgroup/2530/

    Signed-off-by: Charlie Brady
    Reviewed-by: Chandra Seetharaman
    Signed-off-by: James Bottomley

    Charlie Brady
     
  • Signed-off-by: Anil Ravindranath
    Signed-off-by: James Bottomley

    Anil Ravindranath
     
  • Signed-off-by: Giridhar Malavali
    Signed-off-by: James Bottomley

    Giridhar Malavali
     
  • Original code would inadvertently skip the deferred
    fc_remote_port_delete() call for rports hanging off any vport.

    Signed-off-by: Andrew Vasquez
    Signed-off-by: Giridhar Malavali
    Signed-off-by: James Bottomley

    Andrew Vasquez
     
  • * Consolidate vport-count processing.
    * Correct vp_idx restrictions during RSCN processing.
    * Push topology verification check to qla2x00_do_dpc_all_vps().
    * Don't skip vport full-login-lip/lip-reset mailbox handling.

    Signed-off-by: Andrew Vasquez
    Signed-off-by: Giridhar Malavali
    Signed-off-by: James Bottomley

    Andrew Vasquez
     
  • Original code would break-out of loop after only one iteration.

    Signed-off-by: Lalit Chandivade
    Signed-off-by: Andrew Vasquez
    Signed-off-by: Giridhar Malavali
    Signed-off-by: James Bottomley

    Lalit Chandivade
     
  • Signed-off-by: Andrew Vasquez
    Signed-off-by: Giridhar Malavali
    Signed-off-by: James Bottomley

    Andrew Vasquez
     
  • In fabric-login based on iop BIT_8 firmware notifies presence of
    a FCP2 device and not necessarily a TAPE device. So instead of
    setting FCF_TAPE_PRESENT flag there we set it using
    scsi_device->type after mid-layer scan recognises "type" of the
    device.

    It also adds a new flag FCF_FCP2_DEVICE for any future use.

    Signed-off-by: Andrew Vasquez
    Signed-off-by: Giridhar Malavali
    Signed-off-by: James Bottomley

    Santosh Vernekar
     

11 Sep, 2009

21 commits

  • Update scsi_io_completion() such that it only fails requests till the
    next error boundary and retry the leftover. This enables block layer
    to merge requests with different failfast settings and still behave
    correctly on errors. Allow merge of requests of different failfast
    settings.

    As SCSI is currently the only subsystem which follows failfast status,
    there's no need to worry about other block drivers for now.

    Signed-off-by: Tejun Heo
    Cc: Niel Lambrechts
    Cc: James Bottomley
    Signed-off-by: Jens Axboe

    Tejun Heo
     
  • This fixes one cause of an occational problem when unloading
    libfc where the exchange manager pool doesn't have all items freed.

    The existing WARN_ON(mp->total_exches destructor function is used to inform us of
    the completion of the flush, and the fr_dev() is left
    NULL to indicate to fcoe_percpu_receive_thread() that
    the skb should be just freed. There's already a check
    for the lp being NULL which prints a message.
    We skip printing the message if the destructor is for flushing.

    Signed-off-by: Joe Eykholt
    Signed-off-by: Robert Love
    Signed-off-by: James Bottomley

    Joe Eykholt
     
  • I saw an lport debug message from the exchange manager saying:
    "lport 70500: Received response for out of range oxid:ffff"

    A trace showed this was a BA_RJT sent due to an incoming ABTS
    which arrived on an unknown exchange. So, the sender of the
    BA_RJT was in error, but in this case, both the initiator and
    responder were the same machine.

    The OX_ID and RX_ID should not have been reversed in this case.

    Signed-off-by: Joe Eykholt
    Signed-off-by: Robert Love
    Signed-off-by: James Bottomley

    Joe Eykholt
     
  • When an RSCN indicates changes to individual remote ports,
    don't blindly log them out and then back in. Instead, determine
    whether they're still in the directory, by doing GPN_ID.

    If that is successful, call login, which will send ADISC and reverify,
    otherwise, call logoff. Perhaps we should just delete the rport,
    not send LOGO, but it seems safer.

    Also, fix a possible issue where if a mix of records in the RSCN
    cause us to queue disc_ports for disc_single and then we decide
    to do full rediscovery, we leak memory for those disc_ports queued.

    So, go through the list of disc_ports even if doing full discovery.
    Free the disc_ports in any case. If any of the disc_single() calls
    return error, do a full discovery.

    The ability to fill in GPN_ID requests was added to fc_ct_fill().
    For this, it needs the FC_ID to be passed in as an arg.
    The did parameter for fc_elsct_send() is used for that, since the
    actual D_DID will always be 0xfffffc for all CT requests so far.

    Signed-off-by: Joe Eykholt
    Signed-off-by: Robert Love
    Signed-off-by: James Bottomley

    Joe Eykholt
     
  • The local port facility has been replying to ADISC requests without
    looking to see if the remote port is logged in. This is incorrect.
    An ADISC request requires PLOGI first. It should be rejected if
    the sending remote port is not logged in.

    This is like other incoming requests that require login, all of
    which should be handled in the remote port module.

    Move the ADISC request handling from fc_lport.c to fc_rport.c.

    Signed-off-by: Joe Eykholt
    Signed-off-by: Robert Love
    Signed-off-by: James Bottomley

    Joe Eykholt
     
  • When rport_login is called on an rport that is already thought
    to be logged in, use ADISC. If that fails, redo PLOGI.
    This is less disruptive after fabric changes that don't affect
    the state of the target.

    Implement the sending of ADISC via fc_els_fill.

    Add ADISC state to the rport state machine. This is entered from READY
    and returns to READY after successful completion. If it fails, the rport
    is either logged off and deleted or re-does PLOGI.

    Signed-off-by: Joe Eykholt
    Signed-off-by: Robert Love
    Signed-off-by: James Bottomley

    Joe Eykholt
     
  • fc_rport_logo_resp() had a call to fc_rport_enter_rtv() if the
    LOGO was accepted. This must've been a copy/paste mistake, but
    it didn't matter since we don't stay in the LOGO state long enough
    to hit this code.

    Change fc_rport_logo_resp() to just enter the delete state
    no matter what.

    Signed-off-by: Joe Eykholt
    Signed-off-by: Robert Love
    Signed-off-by: James Bottomley

    Joe Eykholt
     
  • After a quick link flap, a target was seen to send us a LOGO.
    Apparently, it saw an RSCN reporting that we had dropped out of the
    fabric after we had logged back into it.

    This is likely in larger fabrics (more than 2 FC switches) after
    a quick link flap at the initiator. Each link transition causes
    an port-specific RSCN to the target. After the link comes back up,
    the initiator successfully discovers and does a PLOGI to the target
    before the target sees the first RSCN reporting the initiator is gone,
    and it sends a LOGO. The target may see a subsequent RSCN saying the
    port is back, but probably wouldn't send a PLOGI and leaves it
    up to the initiator to re-login.

    An RSCN can be delayed by the switches due to software layers but a
    PLOGI is forwarded in hardware causing the PLOGI to beat the RSCN.

    If a remote port is in the discovered set and sends a LOGO, re-login to it.

    Signed-off-by: Joe Eykholt
    Signed-off-by: Robert Love
    Signed-off-by: James Bottomley

    Joe Eykholt
     
  • When receiving an ELS request, if the request isn't recognized,
    the unsupported operation error should be given even if the port
    is not found or not logged in.

    Also, the LOGO request shouldn't give the login-required explanation.

    Signed-off-by: Joe Eykholt
    Signed-off-by: Robert Love
    Signed-off-by: James Bottomley

    Joe Eykholt
     
  • libfc receives PLOGIs from switches which are trying to discover what
    kind of devices are present, and from other initiators to find out
    if we're a target.

    As an initiator, some argue we don't need to handle incoming PLOGI
    requests, and we currently reject them from unknown remote ports,
    but accept them is we're in the middle of a PLOGI to the remote port.

    For eventual target implementations, we want to handle them always.

    For incoming PLOGI, don't fail if the rport_priv doesn't exist.
    Just create it and go become READY without going through PRLI. If
    PRLI occurs, then our roles will be set and we'll become READY again.

    Also, allow incoming PRLI in RTV state.

    Signed-off-by: Joe Eykholt
    Signed-off-by: Robert Love
    Signed-off-by: James Bottomley

    Joe Eykholt
     
  • Improve lport and rport debug messages to indicate whether
    the response is LS_ACC, LS_RJT, closed, or timeout.

    Signed-off-by: Joe Eykholt
    Signed-off-by: Robert Love
    Signed-off-by: James Bottomley

    Joe Eykholt
     
  • The rport_lookup function must be called while holding the disc_mutex.
    Otherwise, the rdata could be deleted just after that by another thread.

    All callers now check the state after grabbing the rdata rp_mutex.
    Even though rport_lookup skips ports in DELETE state, it does that
    without holding the rdata rp_mutex, so that the state may change.

    Signed-off-by: Joe Eykholt
    Signed-off-by: Robert Love
    Signed-off-by: James Bottomley

    Joe Eykholt
     
  • This moves the remote port lookup for incoming ELS requests into
    fc_rport.c, in preparation for handing PLOGI and LOGO from
    unknown rports.

    This changes the arg to rport_recv_req from an rdata to an lport.

    Signed-off-by: Joe Eykholt
    Signed-off-by: Robert Love
    Signed-off-by: James Bottomley

    Joe Eykholt
     
  • Don't trust previous roles, reset them when we receive a PRLI.

    Signed-off-by: Robert Love
    Signed-off-by: James Bottomley

    Robert Love
     
  • Currently these values are initialized by the callers. This was exposed
    by a later patch that adds PLOGI request support. The patch failed to
    initialize the new remote port's roles and it caused problems. This patch
    has the rport_create routine initialize the identifiers and then the
    callers can override them with real values.

    Signed-off-by: Robert Love
    Signed-off-by: James Bottomley

    Robert Love
     
  • It's possible to "restart" discovery before it was started if
    an RSCN is received early enough. We were jumping to 0
    due to the disc_callback function pointer not getting set.

    Don't restart discovery if disc_callback is NULL.

    Signed-off-by: Joe Eykholt
    Signed-off-by: Robert Love
    Signed-off-by: James Bottomley

    Joe Eykholt
     
  • The discovery code had a special-case for the point-to-point mode,
    which used a bunch of code that wasn't really needed.

    Now that rport_create adds the rport to the discovery list,
    completely skip discovery for the point-to-point case.

    Signed-off-by: Joe Eykholt
    Signed-off-by: Robert Love
    Signed-off-by: James Bottomley

    Joe Eykholt
     
  • In fc_disc_gpn_ft_parse(), after fc_disc_done() is called, the
    disc state is changed by setting buf_len = 0. This is wrong
    since the discovery may have restarted. Instead, return
    after calling fc_disc_done.

    Also, return an error on memory allocation failure.

    Signed-off-by: Joe Eykholt
    Signed-off-by: Robert Love
    Signed-off-by: James Bottomley

    Joe Eykholt
     
  • Currently fc_disc_timeout() restarts discovery only if it is not pending.
    When the timer is scheduled, the discovery is left pending, so the
    timeout never restarts it.

    Fix by not checking for pending in the timeout handler.

    If discovery is stopped and restarted in the meantime, the timeout will
    be canceled.

    Also, when a new discovery is started, the retry count wasn't cleared.

    Signed-off-by: Joe Eykholt
    Signed-off-by: Robert Love
    Signed-off-by: James Bottomley

    Joe Eykholt
     
  • On some switches, an empty zone causes GPN_FT to be rejected
    with reason 9 (unable) explanation 7 (FC-4 types not registered),
    which causes discovery to be retried endlessly. Treat this as
    just an empty response and consider discovery complete.

    Signed-off-by: Joe Eykholt
    Signed-off-by: Robert Love
    Signed-off-by: James Bottomley

    Joe Eykholt
     
  • Abhijeet Joglekar wrote: "In gpn_ft_resp, if the payload is short,
    or unexpected response or out of sequence frame, then we just
    return and do nothing. We should either enter fc_disc_done()
    with DISC_EV_FAIL which will then restart any queued discovery
    requests or call lport module which will reset local port,
    or we should call fc_disc_error() so that the gpn_ft is retried.

    The situation as is causes discovery to remain pending and never
    get restarted, in these rare cases. We saw this due to a coding
    bug in fc_disc before. The only ways it could happen would be
    bugs, packet corruption or an FC fabric problem.

    Change it to fail discovery. The local port will restart
    discovery, although it probably should just give up until
    the next link flap.

    Signed-off-by: Joe Eykholt
    Signed-off-by: Robert Love
    Signed-off-by: James Bottomley

    Joe Eykholt