26 Jul, 2008

2 commits

  • fs/exec.c used to need mman.h pagemap.h swap.h and rmap.h when it did
    mm-ish stuff in install_arg_page(); but no need for them after 2.6.22.

    [akpm@linux-foundation.org: unbreak arm]
    Signed-off-by: Hugh Dickins
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Hugh Dickins
     
  • Replace the private BE16/BE32/BE64 macros with direct calls to
    get_unaligned_be16/32/64.

    Signed-off-by: Harvey Harrison
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Harvey Harrison
     

25 Jul, 2008

38 commits

  • This fixes the following compile error caused by commit
    f9247273cb69ba101877e946d2d83044409cc8c5 ("UFS: add const to parser
    token table"):

    CC fs/nfs/nfsroot.o
    /home/bunk/linux/kernel-2.6/git/linux-2.6/fs/nfs/nfsroot.c:130: error: tokens causes a section type conflict
    make[3]: *** [fs/nfs/nfsroot.o] Error 1

    Signed-off-by: Adrian Bunk
    Signed-off-by: Linus Torvalds

    Adrian Bunk
     
  • This patch adds a "const" to the parser token table. I've done an
    allmodconfig build to see if this produces any warnings/failures and the
    patch includes a fix for the only warning that was produced.

    Signed-off-by: Steven Whitehouse
    Acked-by: Alexander Viro
    Acked-by: Evgeniy Dushistov
    Signed-off-by: Linus Torvalds

    Steven Whitehouse
     
  • The ioctls AUTOFS_IOC_TOGGLEREGHOST and AUTOFS_IOC_ASKREGHOST were added
    several years ago but what they were intended for has never been
    implemented (as far as I'm aware noone uses them) so remove them.

    Signed-off-by: Ian Kent
    Reviewed-by: Jeff Moyer
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • This patch re-orgnirzes the checking for and waiting on active expires and
    elininates redundant checks.

    Signed-off-by: Ian Kent
    Cc: Jeff Moyer
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • Appologies, somehow I seem to have sent an out dated version of this
    patch. Here is an additional patch that brings the patch up to date.

    Signed-off-by: Ian Kent
    Cc: Jeff Moyer
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • For direct and offset type mounts that are covered by another mount we
    cannot check the AUTOFS_INF_EXPIRING flag during a path walk which leads
    to lookups walking into an expiring mount while it is being expired.

    For example, for the direct multi-mount map entry with a couple of
    offsets:

    /race/mm1 / :/
    /om1 :/
    /om2 :/

    an autofs trigger mount is mounted on /race/mm1 and when accessed it is
    over mounted and trigger mounts made for /race/mm1/om1 and /race/mm1/om2.
    So it isn't possible for path walks to see the expiring flag at all and
    they happily walk into the file system while it is expiring.

    When expiring these mounts follow_down() must stop at the autofs mount and
    all processes must block in the ->follow_link() method (except the daemon)
    until the expire is complete. This is done by decrementing the d_mounted
    field of the autofs trigger mount root dentry until the expire is
    completed. In ->follow_link() all processes wait on the expire and the
    mount following is completed for the daemon until the expire is complete.

    Signed-off-by: Ian Kent
    Cc: Jeff Moyer
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • The selection of a dentry for expiration and the setting of the
    AUTOFS_INF_EXPIRING flag isn't done atomically which can lead to lookups
    walking into an expiring mount.

    What happens is that an expire is initiated by the daemon and a dentry is
    selected for expire but, since there is no lock held between the selection
    and setting of the expiring flag, a process may find the flag clear and
    continue walking into the mount tree at the same time the daemon attempts
    the expire it.

    Signed-off-by: Ian Kent
    Reviewed-by: Jeff Moyer
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • There are two cases for which a dentry that has a pending mount request
    does not wait for completion. One is via autofs4_revalidate() and the
    other via autofs4_follow_link().

    In revalidate, after the mount point directory is created, but before the
    mount is done, the check in try_to_fill_dentry() can can fail to send the
    dentry to the wait queue since the dentry is positive and the lookup flags
    may contain only LOOKUP_FOLLOW. Although we don't trigger a mount for the
    LOOKUP_FOLLOW flag, if ther's one pending we might as well wait and use
    the mounted dentry for the lookup.

    In autofs4_follow_link() the dentry is not checked to see if it is pending
    so it may fail to call try_to_fill_dentry() and not wait for mount
    completion.

    A dentry that is pending must always be sent to the wait queue.

    Signed-off-by: Ian Kent
    Reviewed-by: Jeff Moyer
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • The mount triggering functionality of readdir and related functions is no
    longer used (and is quite broken as well). The unused portions have been
    removed.

    Signed-off-by: Ian Kent
    Reviewed-by: Jeff Moyer
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • We have been seeing mount requests comming to the automount daemon for
    keys of the form "/" which are lookups for
    invalid map keys. But we can check for this in the kernel module and
    return a fail immediately, without having to send a request to the daemon.

    It is possible to recognise these requests are invalid based on whether
    the request dentry is negative and its relation to the autofs file system
    root.

    For example, given the indirect multi-mount map entry:

    idm1 \
    /mm1 :/
    /mm2 :/

    For a request to mount idm1, IS_ROOT((idm1)->d_parent) will be always be
    true and the dentry may be negative. But directories idm1/mm1 and
    idm1/mm2 will always be created as part of the mount request for idm1. So
    any mount request within idm1 itself must have a positive dentry otherwise
    the map key is invalid.

    In version 4 these multi-mount entries are all mounted and umounted as a
    single request and in version 5 the directories idm1/mm1 and idm1/mm2 are
    created and an autofs fs mounted on them to act as a mount trigger so the
    above is also true.

    This also holds true for the autofs version 4 pseudo direct mount feature.
    When this feature is used without the "--ghost" option automount(8) will
    create internal submounts as we go down the map key paths which are
    essentially normal indirect mounts for which the above holds. If the
    "--ghost" option is given the directories for map keys are created at
    daemon startup so valid map entries correspond to postive dentries in the
    autofs fs.

    autofs version 5 direct mount maps are similar except that the IS_ROOT
    check is not needed. This has been addressed in a previous patch tittled
    "autofs4 - detect invalid direct mount requests".

    For example, given the direct multi-mount map entry:

    /test/dm1 \
    /mm1 :/
    /mm2 :/

    An autofs fs is mounted on /test/dm1 as a trigger mount and when a mount
    is triggered for /test/dm1, the multi-mount offset directories
    /test/dm1/mm1 and /test/dm1/mm2 are created and an autofs fs is mounted on
    them to act as mount triggers. So valid direct mount requests must always
    have a positive dentry if they correspond to a valid map entry.

    Signed-off-by: Ian Kent
    Acked-by: Jeff Moyer
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • autofs v5 direct and offset mounts within an autofs filesystem are
    triggered by existing autofs triger mounts so the mount point dentry must
    be positive. If the mount point dentry is negative then the trigger
    doesn't exist so we can return fail immediately.

    Signed-off-by: Ian Kent
    Cc: Jeff Moyer
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • If an autofs mount becomes catatonic before autofs4_wait_release() is
    called the wait queue counter will not be decremented down to zero and the
    entry will never be freed. There are also races decrementing the wait
    counter in the wait release function. To deal with this the counter needs
    to be updated while holding the wait queue mutex and waiters need to be
    woken up unconditionally when the wait is removed from the queue to ensure
    we eventually free the wait.

    Signed-off-by: Ian Kent
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • It is possible for an autofs mount to become catatonic (and for the daemon
    communication pipe to become NULL) after a wait has been initiallized but
    before the request has been sent to the daemon. We need to check for this
    before sending the request packet.

    Signed-off-by: Ian Kent
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • It see that the patch tittled "autofs4 - fix pending mount race" is
    missing a change that I had recently made.

    It's missing a kfree for the case mutex_lock_interruptible() fails
    to aquire the wait queue mutex.

    Signed-off-by: Ian Kent
    Cc: Jeff Moyer
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • Close a race between a pending mount that is about to finish and a new
    lookup for the same directory.

    Process P1 triggers a mount of directory foo. It sets
    DCACHE_AUTOFS_PENDING in the ->lookup routine, creates a waitq entry for
    'foo', and calls out to the daemon to perform the mount. The autofs
    daemon will then create the directory 'foo', using a new dentry that will
    be hashed in the dcache.

    Before the mount completes, another process, P2, tries to walk into the
    'foo' directory. The vfs path walking code finds an entry for 'foo' and
    calls the revalidate method. Revalidate finds that the entry is not
    PENDING (because PENDING was never set on the dentry created by the
    mkdir), but it does find the directory is empty. Revalidate calls
    try_to_fill_dentry, which sets the PENDING flag and then calls into the
    autofs4 wait code to trigger or wait for a mount of 'foo'. The wait code
    finds the entry for 'foo' and goes to sleep waiting for the completion of
    the mount.

    Yet another process, P3, tries to walk into the 'foo' directory. This
    process again finds a dentry in the dcache for 'foo', and calls into the
    autofs revalidate code.

    The revalidate code finds that the PENDING flag is set, and so calls
    try_to_fill_dentry.

    a) try_to_fill_dentry sets the PENDING flag redundantly for this
    dentry, then calls into the autofs4 wait code.

    b) the autofs4 wait code takes the waitq mutex and searches for an
    entry for 'foo'

    Between a and b, P1 is woken up because the mount completed. P1 takes the
    wait queue mutex, clears the PENDING flag from the dentry, and removes the
    waitqueue entry for 'foo' from the list.

    When it releases the waitq mutex, P3 (eventually) acquires it. At this
    time, it looks for an existing waitq for 'foo', finds none, and so creates
    a new one and calls out to the daemon to mount the 'foo' directory.

    Now, the reason that three processes are required to trigger this race is
    that, because the PENDING flag is not set on the dentry created by mkdir,
    the window for the race would be way to slim for it to ever occur.
    Basically, between the testing of d_mountpoint(dentry) and the taking of
    the waitq mutex, the mount would have to complete and the daemon would
    have to be woken up, and that in turn would have to wake up P1. This is
    simply impossible. Add the third process, though, and it becomes slightly
    more likely.

    Signed-off-by: Jeff Moyer
    Signed-off-by: Ian Kent
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • The autofs4_catatonic_mode() function accesses the wait queue without any
    locking but can be called at any time. This could lead to a possible
    double free of the name field of the wait and a double fput of the daemon
    communication pipe or an fput of a NULL file pointer.

    Signed-off-by: Ian Kent
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • The autofs_wait_queue already contains all of the fields of the
    struct qstr, so change it into a qstr.

    This patch, from Jeff Moyer, has been modified a liitle by myself.

    Signed-off-by: Jeff Moyer
    Signed-off-by: Ian Kent
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Jeff Moyer
     
  • When an open(2) call is made on an autofs mount point directory that
    already exists and the O_DIRECTORY flag is not used the needed mount
    callback to the daemon is not done. This leads to the path walk
    continuing resulting in a callback to the daemon with an incorrect
    key. open(2) is called without O_DIRECTORY by the "find" utility but
    this should be handled properly anyway.

    This happens because autofs needs to use the lookup flags to decide
    when to callback to the daemon to perform a mount to prevent mount
    storms. For example, an autofs indirect mount map that has the "browse"
    option will have the mount point directories are pre-created and the
    stat(2) call made by a color ls against each directory will cause all
    these directories to be mounted. It is unfortunate we need to resort
    to this but mount maps can be quite large. Additionally, if a user
    manually umounts an autofs indirect mount the directory isn't removed
    which also leads to this situation.

    To resolve this autofs needs to use the lookup intent flags to enable
    it to make this decision. This patch adds this check and triggers a
    call back if any of the lookup intent flags are set as all these calls
    warrant a mount attempt be requested.

    I know that external VFS code which uses the lookup flags is something
    that the VFS would like to eliminate but I have no choice as I can't
    see any other way to do this. A VFS dentry or inode operation callback
    which returns the lookup "type" (requires a definition) would be
    sufficient. But this change is needed now and I'm not aware of the form
    that coming VFS changes will take so I'm not willing to propose anything
    along these lines.

    If anyone can provide an alternate method I would be happy to use it.

    [akpm@linux-foundation.org: fix build for concurrent VFS changes]
    Signed-off-by: Ian Kent
    Cc: Al Viro
    Cc: Jeff Moyer
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • Since we now delay hashing of dentrys until the ->mkdir() call, droping
    and re-taking the directory mutex within the ->lookup() function when we
    are being called by user space is not needed. This can lead to a race
    when other processes are attempting to access the same directory during
    mount point directory creation.

    In this case we need to hang onto the mutex to ensure we don't get user
    processes trying to create a mount request for a newly created dentry
    after the mount point entry has already been created. This ensures that
    when we need to check a dentry passed to autofs4_wait(), if it is hashed,
    it is always the mount point dentry and not a new dentry created by
    another lookup during directory creation.

    Signed-off-by: Ian Kent
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • The length of the symlink name has been moved but it needs to be set
    before allocating space for it in the dentry info struct. This corrects a
    mistake in a recent patch.

    Signed-off-by: Ian Kent
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • A while ago a patch to resolve a deadlock during directory creation was
    merged. This delayed the hashing of lookup dentrys until the ->mkdir()
    (or ->symlink()) operation completed to ensure we always went through
    ->lookup() instead of also having processes go through ->revalidate() so
    our VFS locking remained consistent.

    Now we are seeing a couple of side affects of that change in situations
    with heavy mount activity.

    Two cases have been identified:

    1) When a mount request is triggered, due to the delayed hashing, the
    directory created by user space for the mount point doesn't have the
    DCACHE_AUTOFS_PENDING flag set. In the case of an autofs multi-mount
    where a tree of mount point directories are created this can lead to
    the path walk continuing rather than the dentry being sent to the wait
    queue to wait for request completion. This is because, if the pending
    flag isn't set, the criteria for deciding this is a mount in progress
    fails to hold, namely that the dentry is not a mount point and has no
    subdirectories.

    2) A mount request dentry is initially created negative and unhashed.
    It remains this way until the ->mkdir() callback completes. Since it
    is unhashed a fresh dentry is used when the user space mount request
    creates the mount point directory. This leaves the original dentry
    negative and unhashed. But revalidate has no way to tell the VFS that
    the dentry has changed, other than to force another ->lookup() by
    returning false, which is at best wastefull and at worst not possible.
    This results in an -ENOENT return from the original path walk when in
    fact the mount succeeded.

    To resolve this we need to ensure that the same dentry is used in all
    calls to ->lookup() during the course of a mount request. This patch
    achieves that by adding the initial dentry to a look aside list and
    removes it at ->mkdir() or ->symlink() completion (or when the dentry is
    released), since these are the only create operations autofs4 supports.

    Signed-off-by: Ian Kent
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • This patch series enables the use of a single dentry for lookups prior to
    the dentry being hashed and so we no longer need to redo the lookup. This
    patch reverts the patch of commit
    033790449ba9c4dcf8478a87693d33df625c23b5.

    Signed-off-by: Ian Kent
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • Correct the error of making a positive dentry negative after it has been
    instantiated.

    The code that makes this error attempts to re-use the dentry from a
    concurrent expire and mount to resolve a race and the dentry used for the
    lookup must be negative for mounts to trigger in the required cases. The
    fact is that the dentry doesn't need to be re-used because all that is
    needed is to preserve the flag that indicates an expire is still
    incomplete at the time of the mount request.

    This change uses the the dentry to check the flag and wait for the expire
    to complete then discards it instead of attempting to re-use it.

    Signed-off-by: Ian Kent
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ian Kent
     
  • There is no good reason to immediately open the lower file, and that can
    cause problems with files that the user does not intend to immediately
    open, such as device nodes.

    This patch removes the persistent file open from the interpose step and
    pushes that to the locations where eCryptfs really does need the lower
    persistent file, such as just before reading or writing the metadata
    stored in the lower file header.

    Two functions are jumping to out_dput when they should just be jumping to
    out on error paths. This patch also fixes these.

    Signed-off-by: Michael Halcrow
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Michael Halcrow
     
  • When creating device nodes, eCryptfs needs to delay actually opening the lower
    persistent file until an application tries to open. Device handles may not be
    backed by anything when they first come into existence.

    [Valdis.Kletnieks@vt.edu: build fix]
    Signed-off-by: Michael Halcrow
    Cc:
    Signed-off-by: Linus Torvalds

    Michael Halcrow
     
  • Fixe sparse warnings:
    fs/ecryptfs/inode.c:368:15: warning: cast to restricted __be64
    fs/ecryptfs/mmap.c:385:12: warning: incorrect type in assignment (different base types)
    fs/ecryptfs/mmap.c:385:12: expected unsigned long long [unsigned] [assigned] [usertype] file_size
    fs/ecryptfs/mmap.c:385:12: got restricted __be64 [usertype]
    fs/ecryptfs/mmap.c:428:12: warning: incorrect type in assignment (different base types)
    fs/ecryptfs/mmap.c:428:12: expected unsigned long long [unsigned] [assigned] [usertype] file_size
    fs/ecryptfs/mmap.c:428:12: got restricted __be64 [usertype]

    Signed-off-by: Harvey Harrison
    Cc: Michael Halcrow
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Harvey Harrison
     
  • Fixes the following sparse warnings:
    fs/ecryptfs/crypto.c:1036:8: warning: cast to restricted __be32
    fs/ecryptfs/crypto.c:1038:8: warning: cast to restricted __be32
    fs/ecryptfs/crypto.c:1077:10: warning: cast to restricted __be32
    fs/ecryptfs/crypto.c:1103:6: warning: incorrect type in assignment (different base types)
    fs/ecryptfs/crypto.c:1105:6: warning: incorrect type in assignment (different base types)
    fs/ecryptfs/crypto.c:1124:8: warning: incorrect type in assignment (different base types)
    fs/ecryptfs/crypto.c:1241:21: warning: incorrect type in assignment (different base types)
    fs/ecryptfs/crypto.c:1244:30: warning: incorrect type in assignment (different base types)
    fs/ecryptfs/crypto.c:1414:23: warning: cast to restricted __be32
    fs/ecryptfs/crypto.c:1417:32: warning: cast to restricted __be16

    Signed-off-by: Harvey Harrison
    Cc: Michael Halcrow
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Harvey Harrison
     
  • Clean up overcomplicated string copy, which also gets rid of this
    bogus warning:

    fs/ecryptfs/main.c: In function 'ecryptfs_parse_options':
    include/asm/arch/string_32.h:75: warning: array subscript is above array bounds

    Signed-off-by: Miklos Szeredi
    Cc: Michael Halcrow
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Miklos Szeredi
     
  • Mounting with invalid key signatures should probably fail, if they were
    specifically requested but not available.

    Also fix case checks in process_request_key_err() for the right sign of
    the errnos, as spotted by Jan Tluka.

    Signed-off-by: Eric Sandeen
    Reviewed-by: Jan Tluka
    Acked-by: Michael Halcrow
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Eric Sandeen
     
  • The userspace eCryptfs daemon sends HELO and QUIT messages to the kernel
    for per-user daemon (un)registration. These messages are required when
    netlink is used as the transport, but (un)registration is handled by
    opening and closing the device file when miscdev is the transport. These
    messages should be discarded in the miscdev transport so that a daemon
    isn't registered twice.

    Signed-off-by: Tyler Hicks
    Cc: Michael Halcrow
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Tyler Hicks
     
  • eCryptfs would really like to have read-write access to all files in the
    lower filesystem. Right now, the persistent lower file may be opened
    read-only if the attempt to open it read-write fails. One way to keep
    from having to do that is to have a privileged kthread that can open the
    lower persistent file on behalf of the user opening the eCryptfs file;
    this patch implements this functionality.

    This patch will properly allow a less-privileged user to open the eCryptfs
    file, followed by a more-privileged user opening the eCryptfs file, with
    the first user only being able to read and the second user being able to
    both read and write. eCryptfs currently does this wrong; it will wind up
    calling vfs_write() on a file that was opened read-only. This is fixed in
    this patch.

    Signed-off-by: Michael Halcrow
    Cc: Dave Kleikamp
    Cc: Serge Hallyn
    Cc: Eric Sandeen
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Michael Halcrow
     
  • Remove the size parameter from the new epoll_create syscall and renames the
    syscall itself. The updated test program follows.

    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    #include
    #include
    #include
    #include
    #include

    #ifndef __NR_epoll_create2
    # ifdef __x86_64__
    # define __NR_epoll_create2 291
    # elif defined __i386__
    # define __NR_epoll_create2 329
    # else
    # error "need __NR_epoll_create2"
    # endif
    #endif

    #define EPOLL_CLOEXEC O_CLOEXEC

    int
    main (void)
    {
    int fd = syscall (__NR_epoll_create2, 0);
    if (fd == -1)
    {
    puts ("epoll_create2(0) failed");
    return 1;
    }
    int coe = fcntl (fd, F_GETFD);
    if (coe == -1)
    {
    puts ("fcntl failed");
    return 1;
    }
    if (coe & FD_CLOEXEC)
    {
    puts ("epoll_create2(0) set close-on-exec flag");
    return 1;
    }
    close (fd);

    fd = syscall (__NR_epoll_create2, EPOLL_CLOEXEC);
    if (fd == -1)
    {
    puts ("epoll_create2(EPOLL_CLOEXEC) failed");
    return 1;
    }
    coe = fcntl (fd, F_GETFD);
    if (coe == -1)
    {
    puts ("fcntl failed");
    return 1;
    }
    if ((coe & FD_CLOEXEC) == 0)
    {
    puts ("epoll_create2(EPOLL_CLOEXEC) set close-on-exec flag");
    return 1;
    }
    close (fd);

    puts ("OK");

    return 0;
    }
    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

    Signed-off-by: Ulrich Drepper
    Acked-by: Davide Libenzi
    Cc: Michael Kerrisk
    Cc:
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ulrich Drepper
     
  • This patch adds test that ensure the boundary conditions for the various
    constants introduced in the previous patches is met. No code is generated.

    [akpm@linux-foundation.org: fix alpha]
    Signed-off-by: Ulrich Drepper
    Acked-by: Davide Libenzi
    Cc: Michael Kerrisk
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ulrich Drepper
     
  • This patch adds non-blocking support for inotify_init1. The
    additional changes needed are minimal.

    The following test must be adjusted for architectures other than x86 and
    x86-64 and in case the syscall numbers changed.

    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    #include
    #include
    #include
    #include

    #ifndef __NR_inotify_init1
    # ifdef __x86_64__
    # define __NR_inotify_init1 294
    # elif defined __i386__
    # define __NR_inotify_init1 332
    # else
    # error "need __NR_inotify_init1"
    # endif
    #endif

    #define IN_NONBLOCK O_NONBLOCK

    int
    main (void)
    {
    int fd = syscall (__NR_inotify_init1, 0);
    if (fd == -1)
    {
    puts ("inotify_init1(0) failed");
    return 1;
    }
    int fl = fcntl (fd, F_GETFL);
    if (fl == -1)
    {
    puts ("fcntl failed");
    return 1;
    }
    if (fl & O_NONBLOCK)
    {
    puts ("inotify_init1(0) set non-blocking mode");
    return 1;
    }
    close (fd);

    fd = syscall (__NR_inotify_init1, IN_NONBLOCK);
    if (fd == -1)
    {
    puts ("inotify_init1(IN_NONBLOCK) failed");
    return 1;
    }
    fl = fcntl (fd, F_GETFL);
    if (fl == -1)
    {
    puts ("fcntl failed");
    return 1;
    }
    if ((fl & O_NONBLOCK) == 0)
    {
    puts ("inotify_init1(IN_NONBLOCK) set non-blocking mode");
    return 1;
    }
    close (fd);

    puts ("OK");

    return 0;
    }
    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

    Signed-off-by: Ulrich Drepper
    Acked-by: Davide Libenzi
    Cc: Michael Kerrisk
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ulrich Drepper
     
  • This patch adds O_NONBLOCK support to pipe2. It is minimally more involved
    than the patches for eventfd et.al but still trivial. The interfaces of the
    create_write_pipe and create_read_pipe helper functions were changed and the
    one other caller as well.

    The following test must be adjusted for architectures other than x86 and
    x86-64 and in case the syscall numbers changed.

    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    #include
    #include
    #include
    #include

    #ifndef __NR_pipe2
    # ifdef __x86_64__
    # define __NR_pipe2 293
    # elif defined __i386__
    # define __NR_pipe2 331
    # else
    # error "need __NR_pipe2"
    # endif
    #endif

    int
    main (void)
    {
    int fds[2];
    if (syscall (__NR_pipe2, fds, 0) == -1)
    {
    puts ("pipe2(0) failed");
    return 1;
    }
    for (int i = 0; i < 2; ++i)
    {
    int fl = fcntl (fds[i], F_GETFL);
    if (fl == -1)
    {
    puts ("fcntl failed");
    return 1;
    }
    if (fl & O_NONBLOCK)
    {
    printf ("pipe2(0) set non-blocking mode for fds[%d]\n", i);
    return 1;
    }
    close (fds[i]);
    }

    if (syscall (__NR_pipe2, fds, O_NONBLOCK) == -1)
    {
    puts ("pipe2(O_NONBLOCK) failed");
    return 1;
    }
    for (int i = 0; i < 2; ++i)
    {
    int fl = fcntl (fds[i], F_GETFL);
    if (fl == -1)
    {
    puts ("fcntl failed");
    return 1;
    }
    if ((fl & O_NONBLOCK) == 0)
    {
    printf ("pipe2(O_NONBLOCK) does not set non-blocking mode for fds[%d]\n", i);
    return 1;
    }
    close (fds[i]);
    }

    puts ("OK");

    return 0;
    }
    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

    Signed-off-by: Ulrich Drepper
    Acked-by: Davide Libenzi
    Cc: Michael Kerrisk
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ulrich Drepper
     
  • This patch adds support for the TFD_NONBLOCK flag to timerfd_create. The
    additional changes needed are minimal.

    The following test must be adjusted for architectures other than x86 and
    x86-64 and in case the syscall numbers changed.

    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    #include
    #include
    #include
    #include
    #include

    #ifndef __NR_timerfd_create
    # ifdef __x86_64__
    # define __NR_timerfd_create 283
    # elif defined __i386__
    # define __NR_timerfd_create 322
    # else
    # error "need __NR_timerfd_create"
    # endif
    #endif

    #define TFD_NONBLOCK O_NONBLOCK

    int
    main (void)
    {
    int fd = syscall (__NR_timerfd_create, CLOCK_REALTIME, 0);
    if (fd == -1)
    {
    puts ("timerfd_create(0) failed");
    return 1;
    }
    int fl = fcntl (fd, F_GETFL);
    if (fl == -1)
    {
    puts ("fcntl failed");
    return 1;
    }
    if (fl & O_NONBLOCK)
    {
    puts ("timerfd_create(0) set non-blocking mode");
    return 1;
    }
    close (fd);

    fd = syscall (__NR_timerfd_create, CLOCK_REALTIME, TFD_NONBLOCK);
    if (fd == -1)
    {
    puts ("timerfd_create(TFD_NONBLOCK) failed");
    return 1;
    }
    fl = fcntl (fd, F_GETFL);
    if (fl == -1)
    {
    puts ("fcntl failed");
    return 1;
    }
    if ((fl & O_NONBLOCK) == 0)
    {
    puts ("timerfd_create(TFD_NONBLOCK) set non-blocking mode");
    return 1;
    }
    close (fd);

    puts ("OK");

    return 0;
    }
    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

    Signed-off-by: Ulrich Drepper
    Acked-by: Davide Libenzi
    Cc: Michael Kerrisk
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ulrich Drepper
     
  • This patch adds support for the EFD_NONBLOCK flag to eventfd2. The
    additional changes needed are minimal.

    The following test must be adjusted for architectures other than x86 and
    x86-64 and in case the syscall numbers changed.

    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    #include
    #include
    #include
    #include

    #ifndef __NR_eventfd2
    # ifdef __x86_64__
    # define __NR_eventfd2 290
    # elif defined __i386__
    # define __NR_eventfd2 328
    # else
    # error "need __NR_eventfd2"
    # endif
    #endif

    #define EFD_NONBLOCK O_NONBLOCK

    int
    main (void)
    {
    int fd = syscall (__NR_eventfd2, 1, 0);
    if (fd == -1)
    {
    puts ("eventfd2(0) failed");
    return 1;
    }
    int fl = fcntl (fd, F_GETFL);
    if (fl == -1)
    {
    puts ("fcntl failed");
    return 1;
    }
    if (fl & O_NONBLOCK)
    {
    puts ("eventfd2(0) sets non-blocking mode");
    return 1;
    }
    close (fd);

    fd = syscall (__NR_eventfd2, 1, EFD_NONBLOCK);
    if (fd == -1)
    {
    puts ("eventfd2(EFD_NONBLOCK) failed");
    return 1;
    }
    fl = fcntl (fd, F_GETFL);
    if (fl == -1)
    {
    puts ("fcntl failed");
    return 1;
    }
    if ((fl & O_NONBLOCK) == 0)
    {
    puts ("eventfd2(EFD_NONBLOCK) does not set non-blocking mode");
    return 1;
    }
    close (fd);

    puts ("OK");

    return 0;
    }
    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

    Signed-off-by: Ulrich Drepper
    Acked-by: Davide Libenzi
    Cc: Michael Kerrisk
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ulrich Drepper
     
  • This patch adds support for the SFD_NONBLOCK flag to signalfd4. The
    additional changes needed are minimal.

    The following test must be adjusted for architectures other than x86 and
    x86-64 and in case the syscall numbers changed.

    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    #include
    #include
    #include
    #include
    #include

    #ifndef __NR_signalfd4
    # ifdef __x86_64__
    # define __NR_signalfd4 289
    # elif defined __i386__
    # define __NR_signalfd4 327
    # else
    # error "need __NR_signalfd4"
    # endif
    #endif

    #define SFD_NONBLOCK O_NONBLOCK

    int
    main (void)
    {
    sigset_t ss;
    sigemptyset (&ss);
    sigaddset (&ss, SIGUSR1);
    int fd = syscall (__NR_signalfd4, -1, &ss, 8, 0);
    if (fd == -1)
    {
    puts ("signalfd4(0) failed");
    return 1;
    }
    int fl = fcntl (fd, F_GETFL);
    if (fl == -1)
    {
    puts ("fcntl failed");
    return 1;
    }
    if (fl & O_NONBLOCK)
    {
    puts ("signalfd4(0) set non-blocking mode");
    return 1;
    }
    close (fd);

    fd = syscall (__NR_signalfd4, -1, &ss, 8, SFD_NONBLOCK);
    if (fd == -1)
    {
    puts ("signalfd4(SFD_NONBLOCK) failed");
    return 1;
    }
    fl = fcntl (fd, F_GETFL);
    if (fl == -1)
    {
    puts ("fcntl failed");
    return 1;
    }
    if ((fl & O_NONBLOCK) == 0)
    {
    puts ("signalfd4(SFD_NONBLOCK) does not set non-blocking mode");
    return 1;
    }
    close (fd);

    puts ("OK");

    return 0;
    }
    ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

    Signed-off-by: Ulrich Drepper
    Acked-by: Davide Libenzi
    Cc: Michael Kerrisk
    Signed-off-by: Andrew Morton
    Signed-off-by: Linus Torvalds

    Ulrich Drepper