13 Feb, 2014

1 commit

  • Russell King observed 'wierd' looking output from debugfs, and also suggested
    better ways of getting device names (use KBUILD_MODNAME, dev_name())

    This patch addresses these issues to make the debugfs output correct and better
    looking.

    While at it, replace seq_printf with seq_puts to remove the checkpatch.pl
    warnings.

    Reported-by: Russell King - ARM Linux
    Signed-off-by: Sumit Semwal

    Sumit Semwal
     

01 May, 2013

2 commits

  • Add debugfs support to make it easier to print debug information
    about the dma-buf buffers.

    Cc: Dave Airlie
    [minor fixes on init and warning fix]
    Cc: Dan Carpenter
    [remove double unlock in fail case]
    Signed-off-by: Sumit Semwal

    Sumit Semwal
     
  • For debugging purposes, it is useful to have a name-string added
    while exporting buffers. Hence, dma_buf_export() is replaced with
    dma_buf_export_named(), which additionally takes 'exp_name' as a
    parameter.

    For backward compatibility, and for lazy exporters who don't wish to
    name themselves, a #define dma_buf_export() is also made available,
    which adds a __FILE__ instead of 'exp_name'.

    Cc: Daniel Vetter
    [Thanks for the idea!]
    Reviewed-by: Daniel Vetter
    Signed-off-by: Sumit Semwal

    Sumit Semwal
     

27 Feb, 2013

1 commit

  • All drivers which implement this need to have some sort of refcount to
    allow concurrent vmap usage. Hence implement this in the dma-buf core.

    To protect against concurrent calls we need a lock, which potentially
    causes new funny locking inversions. But this shouldn't be a problem
    for exporters with statically allocated backing storage, and more
    dynamic drivers have decent issues already anyway.

    Inspired by some refactoring patches from Aaron Plattner, who
    implemented the same idea, but only for drm/prime drivers.

    v2: Check in dma_buf_release that no dangling vmaps are left.
    Suggested by Aaron Plattner. We might want to do similar checks for
    attachments, but that's for another patch. Also fix up ERR_PTR return
    for vmap.

    v3: Check whether the passed-in vmap address matches with the cached
    one for vunmap. Eventually we might want to remove that parameter -
    compared to the kmap functions there's no need for the vaddr for
    unmapping. Suggested by Chris Wilson.

    v4: Fix a brown-paper-bag bug spotted by Aaron Plattner.

    Cc: Aaron Plattner
    Reviewed-by: Aaron Plattner
    Tested-by: Aaron Plattner
    Reviewed-by: Rob Clark
    Signed-off-by: Daniel Vetter
    Signed-off-by: Sumit Semwal

    Daniel Vetter
     

29 Jan, 2013

1 commit


09 Jan, 2013

1 commit


20 Dec, 2012

1 commit

  • Documentation says that code requiring dma-buf should add it to
    select, so inline fallbacks are not going to be used. A link error
    will make it obvious what went wrong, instead of silently doing
    nothing at runtime.

    Signed-off-by: Maarten Lankhorst
    Reviewed-by: Daniel Vetter
    Reviewed-by: Rob Clark
    Signed-off-by: Sumit Semwal

    Maarten Lankhorst
     

25 May, 2012

3 commits

  • Some minor inline documentation fixes for gaps resulting from new patches.

    Signed-off-by: Sumit Semwal
    Signed-off-by: Sumit Semwal

    Sumit Semwal
     
  • The main requirement I have for this interface is for scanning out
    using the USB gpu devices. Since these devices have to read the
    framebuffer on updates and linearly compress it, using kmaps
    is a major overhead for every update.

    v2: fix warn issues pointed out by Sylwester Nawrocki.

    v3: fix compile !CONFIG_DMA_SHARED_BUFFER and add _GPL for now

    Signed-off-by: Dave Airlie
    Reviewed-by: Rob Clark
    Signed-off-by: Sumit Semwal

    Dave Airlie
     
  • Compared to Rob Clark's RFC I've ditched the prepare/finish hooks
    and corresponding ioctls on the dma_buf file. The major reason for
    that is that many people seem to be under the impression that this is
    also for synchronization with outstanding asynchronous processsing.
    I'm pretty massively opposed to this because:

    - It boils down reinventing a new rather general-purpose userspace
    synchronization interface. If we look at things like futexes, this
    is hard to get right.
    - Furthermore a lot of kernel code has to interact with this
    synchronization primitive. This smells a look like the dri1 hw_lock,
    a horror show I prefer not to reinvent.
    - Even more fun is that multiple different subsystems would interact
    here, so we have plenty of opportunities to create funny deadlock
    scenarios.

    I think synchronization is a wholesale different problem from data
    sharing and should be tackled as an orthogonal problem.

    Now we could demand that prepare/finish may only ensure cache
    coherency (as Rob intended), but that runs up into the next problem:
    We not only need mmap support to facilitate sw-only processing nodes
    in a pipeline (without jumping through hoops by importing the dma_buf
    into some sw-access only importer), which allows for a nicer
    ION->dma-buf upgrade path for existing Android userspace. We also need
    mmap support for existing importing subsystems to support existing
    userspace libraries. And a loot of these subsystems are expected to
    export coherent userspace mappings.

    So prepare/finish can only ever be optional and the exporter /needs/
    to support coherent mappings. Given that mmap access is always
    somewhat fallback-y in nature I've decided to drop this optimization,
    instead of just making it optional. If we demonstrate a clear need for
    this, supported by benchmark results, we can always add it in again
    later as an optional extension.

    Other differences compared to Rob's RFC is the above mentioned support
    for mapping a dma-buf through facilities provided by the importer.
    Which results in mmap support no longer being optional.

    Note that this dma-buf mmap patch does _not_ support every possible
    insanity an existing subsystem could pull of with mmap: Because it
    does not allow to intercept pagefaults and shoot down ptes importing
    subsystems can't add some magic of their own at these points (e.g. to
    automatically synchronize with outstanding rendering or set up some
    special resources). I've done a cursory read through a few mmap
    implementions of various subsytems and I'm hopeful that we can avoid
    this (and the complexity it'd bring with it).

    Additonally I've extended the documentation a bit to explain the hows
    and whys of this mmap extension.

    In case we ever want to add support for explicitly cache maneged
    userspace mmap with a prepare/finish ioctl pair, we could specify that
    userspace needs to mmap a different part of the dma_buf, e.g. the
    range starting at dma_buf->size up to dma_buf->size*2. This works
    because the size of a dma_buf is invariant over it's lifetime. The
    exporter would obviously need to fall back to coherent mappings for
    both ranges if a legacy clients maps the coherent range and the
    architecture cannot suppor conflicting caching policies. Also, this
    would obviously be optional and userspace needs to be able to fall
    back to coherent mappings.

    v2:
    - Spelling fixes from Rob Clark.
    - Compile fix for !DMA_BUF from Rob Clark.
    - Extend commit message to explain how explicitly cache managed mmap
    support could be added later.
    - Extend the documentation with implementations notes for exporters
    that need to manually fake coherency.

    v3:
    - dma_buf pointer initialization goof-up noticed by Rebecca Schultz
    Zavin.

    Cc: Rob Clark
    Cc: Rebecca Schultz Zavin
    Acked-by: Rob Clark
    Signed-Off-by: Daniel Vetter
    Signed-off-by: Sumit Semwal

    Daniel Vetter
     

29 Mar, 2012

1 commit

  • Pull dma-buf updates from Sumit Semwal:
    "This includes the following key items:

    - kernel cpu access support,
    - flag-passing to dma_buf_fd,
    - relevant Documentation updates, and
    - some minor cleanups and fixes.

    These changes are needed for the drm prime/dma-buf interface code that
    Dave Airlie plans to submit in this merge window."

    * 'for-linus-3.4' of git://git.linaro.org/people/sumitsemwal/linux-dma-buf:
    dma-buf: correct dummy function declarations.
    dma-buf: document fd flags and O_CLOEXEC requirement
    dma_buf: Add documentation for the new cpu access support
    dma-buf: add support for kernel cpu access
    dma-buf: don't hold the mutex around map/unmap calls
    dma-buf: add get_dma_buf()
    dma-buf: pass flags into dma_buf_fd.
    dma-buf: add dma_data_direction to unmap dma_buf_op
    dma-buf: Move code out of mutex-protected section in dma_buf_attach()
    dma-buf: Return error instead of using a goto statement when possible
    dma-buf: Remove unneeded sanity checks
    dma-buf: Constify ops argument to dma_buf_export()

    Linus Torvalds
     

26 Mar, 2012

6 commits

  • Dummy functions for the newly added cpu access ops need variable names for
    arguments.
    Also, the introduction of flags in dma_buf_fd needs to be added to dummy
    functions as well.

    Signed-off-by: Sumit Semwal
    Signed-off-by: Sumit Semwal

    Sumit Semwal
     
  • Big differences to other contenders in the field (like ion) is
    that this also supports highmem, so we have to split up the cpu
    access from the kernel side into a prepare and a kmap step.

    Prepare is allowed to fail and should do everything required so that
    the kmap calls can succeed (like swapin/backing storage allocation,
    flushing, ...).

    More in-depth explanations will follow in the follow-up documentation
    patch.

    Changes in v2:

    - Clear up begin_cpu_access confusion noticed by Sumit Semwal.
    - Don't automatically fallback from the _atomic variants to the
    non-atomic variants. The _atomic callbacks are not allowed to
    sleep, so we want exporters to make this decision explicit. The
    function signatures are explicit, so simpler exporters can still
    use the same function for both.
    - Make the unmap functions optional. Simpler exporters with permanent
    mappings don't need to do anything at unmap time.

    Changes in v3:

    - Adjust the WARN_ON checks for the new ->ops functions as suggested
    by Rob Clark and Sumit Semwal.
    - Rebased on top of latest dma-buf-next git.

    Changes in v4:

    - Fixup a missing - in a return -EINVAL; statement.

    Signed-Off-by: Daniel Vetter
    Signed-off-by: Rob Clark
    Signed-off-by: Sumit Semwal

    Daniel Vetter
     
  • The mutex protects the attachment list and hence needs to be held
    around the callbakc to the exporters (optional) attach/detach
    functions.

    Holding the mutex around the map/unmap calls doesn't protect any
    dma_buf state. Exporters need to properly protect any of their own
    state anyway (to protect against calls from their own interfaces).
    So this only makes the locking messier (and lockdep easier to anger).

    Therefore let's just drop this.

    v2: Rebased on top of latest dma-buf-next git.

    Signed-off-by: Daniel Vetter
    Reviewed-by: Rob Clark
    Signed-off-by: Sumit Semwal

    Daniel Vetter
     
  • Works in a similar way to get_file(), and is needed in cases such as
    when the exporter needs to also keep a reference to the dmabuf (that
    is later released with a dma_buf_put()), and possibly other similar
    cases.

    Signed-off-by: Rob Clark
    Reviewed-by: Dave Airlie
    Reviewed-by: Kyungmin Park
    Signed-off-by: Sumit Semwal

    Rob Clark
     
  • We need to pass the flags into dma_buf_fd at this point,
    so the flags end up doing the right thing for O_CLOEXEC.

    Signed-off-by: Dave Airlie
    Signed-off-by: Rob Clark
    Signed-off-by: Sumit Semwal

    Dave Airlie
     
  • Some exporters may use DMA map/unmap APIs in dma-buf ops, which require
    enum dma_data_direction for both map and unmap operations.

    Thus, the unmap dma_buf_op also needs to have enum dma_data_direction as
    a parameter.

    Reported-by: Tomasz Stanislawski
    Signed-off-by: Sumit Semwal
    Reviewed-by: Daniel Vetter
    Signed-off-by: Sumit Semwal

    Sumit Semwal
     

19 Mar, 2012

1 commit


16 Mar, 2012

1 commit

  • The header includes a lot of stuff, and
    it in turn gets a lot of use just for the basic "struct device"
    which appears so often.

    Clean up the users as follows:

    1) For those headers only needing "struct device" as a pointer
    in fcn args, replace the include with exactly that.

    2) For headers not really using anything from device.h, simply
    delete the include altogether.

    3) For headers relying on getting device.h implicitly before
    being included themselves, now explicitly include device.h

    4) For files in which doing #1 or #2 uncovers an implicit
    dependency on some other header, fix by explicitly adding
    the required header(s).

    Any C files that were implicitly relying on device.h to be
    present have already been dealt with in advance.

    Total removals from #1 and #2: 51. Total additions coming
    from #3: 9. Total other implicit dependencies from #4: 7.

    As of 3.3-rc1, there were 110, so a net removal of 42 gives
    about a 38% reduction in device.h presence in include/*

    Signed-off-by: Paul Gortmaker

    Paul Gortmaker
     

06 Jan, 2012

1 commit

  • This is the first step in defining a dma buffer sharing mechanism.

    A new buffer object dma_buf is added, with operations and API to allow easy
    sharing of this buffer object across devices.

    The framework allows:
    - creation of a buffer object, its association with a file pointer, and
    associated allocator-defined operations on that buffer. This operation is
    called the 'export' operation.
    - different devices to 'attach' themselves to this exported buffer object, to
    facilitate backing storage negotiation, using dma_buf_attach() API.
    - the exported buffer object to be shared with the other entity by asking for
    its 'file-descriptor (fd)', and sharing the fd across.
    - a received fd to get the buffer object back, where it can be accessed using
    the associated exporter-defined operations.
    - the exporter and user to share the scatterlist associated with this buffer
    object using map_dma_buf and unmap_dma_buf operations.

    Atleast one 'attach()' call is required to be made prior to calling the
    map_dma_buf() operation.

    Couple of building blocks in map_dma_buf() are added to ease introduction
    of sync'ing across exporter and users, and late allocation by the exporter.

    For this first version, this framework will work with certain conditions:
    - *ONLY* exporter will be allowed to mmap to userspace (outside of this
    framework - mmap is not a buffer object operation),
    - currently, *ONLY* users that do not need CPU access to the buffer are
    allowed.

    More details are there in the documentation patch.

    This is based on design suggestions from many people at the mini-summits[1],
    most notably from Arnd Bergmann , Rob Clark and
    Daniel Vetter .

    The implementation is inspired from proof-of-concept patch-set from
    Tomasz Stanislawski , who demonstrated buffer sharing
    between two v4l2 devices. [2]

    [1]: https://wiki.linaro.org/OfficeofCTO/MemoryManagement
    [2]: http://lwn.net/Articles/454389

    Signed-off-by: Sumit Semwal
    Signed-off-by: Sumit Semwal
    Reviewed-by: Daniel Vetter
    Reviewed-by: Dave Airlie
    Reviewed-and-Tested-by: Rob Clark
    Signed-off-by: Dave Airlie

    Sumit Semwal