linux

Author	SHA1	Message	Date
Colin Ian King	4defb5f912	orangefs: fix memory leak of string 'new' on exit path allocates string 'new' is not free'd on the exit path when cdm_element_count <= 0. Fix this by kfree'ing it. Fixes CoverityScan CID#1375923 "Resource Leak" Signed-off-by: Colin Ian King <colin.king@canonical.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-12-12 11:43:25 -05:00
Miklos Szeredi	dfeef68862	vfs: remove ".readlink = generic_readlink" assignments If .readlink == NULL implies generic_readlink(). Generated by: to_del="\.readlink.=.generic_readlink" for i in `git grep -l $to_del`; do sed -i "/$to_del"/d $i; done Signed-off-by: Miklos Szeredi <mszeredi@redhat.com>	2016-12-09 16:45:04 +01:00
Al Viro	cbbd26b8b1	[iov_iter] new primitives - copy_from_iter_full() and friends copy_from_iter_full(), copy_from_iter_full_nocache() and csum_and_copy_from_iter_full() - counterparts of copy_from_iter() et.al., advancing iterator only in case of successful full copy and returning whether it had been successful or not. Convert some obvious users. NOTE - do not blindly assume that something is a good candidate for those unless you are sure that not advancing iov_iter in failure case is the right thing in this case. Anything that does short read/short write kind of stuff (or is in a loop, etc.) is unlikely to be a good one. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>	2016-12-05 14:33:36 -05:00
Al Viro	450630975d	don't open-code file_inode() Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>	2016-12-04 18:29:28 -05:00
Mike Marshall	19ff7fcc76	orangefs: add .owner to debugfs file_operations Without ".owner = THIS_MODULE" it is possible to crash the kernel by unloading the Orangefs module while someone is reading debugfs files. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-11-16 11:52:19 -05:00
Mike Marshall	dc0336214e	orangefs: clean up debugfs We recently refactored the Orangefs debugfs code. The refactor seemed to trigger dan.carpenter@oracle.com's static tester to find a possible double-free in the code. While designing the fix we saw a condition under which the buffer being freed could also be overflowed. We also realized how to rebuild the related debugfs file's "contents" (a string) without deleting and re-creating the file. This fix should eliminate the possible double-free, the potential overflow and improve code readability. Signed-off-by: Mike Marshall <hubcap@omnibond.com> Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-11-07 10:41:55 -05:00
Christoph Hellwig	2f8b544477	block,fs: untangle fs.h and blk_types.h Nothing in fs.h should require blk_types.h to be included. Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jens Axboe <axboe@fb.com>	2016-11-01 09:43:26 -06:00
Miklos Szeredi	804b1737d7	orangefs: don't use d_time Instead use d_fsdata which is the same size. Hoping to get rid of d_time, which is used by very few filesystems by this time. Signed-off-by: Miklos Szeredi <mszeredi@redhat.com> Reviewed-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-10-24 14:50:07 -04:00
Amir Goldstein	d62a9025ae	orangefs: user file_inode() where it is due Replace wrong use of file->f_path.dentry->d_inode with file_inode(file). In case orangefs ever finds itself as an overelayfs layer, it would want to get its own inode and not overlayfs's inode. DISCLAIMER: I did not test this patch because I do not know how to setup an orangefs mount Signed-off-by: Amir Goldstein <amir73il@gmail.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-10-24 14:29:39 -04:00
Linus Torvalds	101105b171	Merge branch 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs Pull more vfs updates from Al Viro: ">rename2() work from Miklos + current_time() from Deepa" * 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs: fs: Replace current_fs_time() with current_time() fs: Replace CURRENT_TIME_SEC with current_time() for inode timestamps fs: Replace CURRENT_TIME with current_time() for inode timestamps fs: proc: Delete inode time initializations in proc_alloc_inode() vfs: Add current_time() api vfs: add note about i_op->rename changes to porting fs: rename "rename2" i_op to "rename" vfs: remove unused i_op->rename fs: make remaining filesystems use .rename2 libfs: support RENAME_NOREPLACE in simple_rename() fs: support RENAME_NOREPLACE for local filesystems ncpfs: fix unused variable warning	2016-10-10 20:16:43 -07:00
Al Viro	3873691e5a	Merge remote-tracking branch 'ovl/rename2' into for-linus	2016-10-10 23:02:51 -04:00
Linus Torvalds	97d2116708	Merge branch 'work.xattr' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs Pull vfs xattr updates from Al Viro: "xattr stuff from Andreas This completes the switch to xattr_handler ->get()/->set() from ->getxattr/->setxattr/->removexattr" * 'work.xattr' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs: vfs: Remove {get,set,remove}xattr inode operations xattr: Stop calling {get,set,remove}xattr inode operations vfs: Check for the IOP_XATTR flag in listxattr xattr: Add __vfs_{get,set,remove}xattr helpers libfs: Use IOP_XATTR flag for empty directory handling vfs: Use IOP_XATTR flag for bad-inode handling vfs: Add IOP_XATTR inode operations flag vfs: Move xattr_resolve_name to the front of fs/xattr.c ecryptfs: Switch to generic xattr handlers sockfs: Get rid of getxattr iop sockfs: getxattr: Fail with -EOPNOTSUPP for invalid attribute names kernfs: Switch to generic xattr handlers hfs: Switch to generic xattr handlers jffs2: Remove jffs2_{get,set,remove}xattr macros xattr: Remove unnecessary NULL attribute name check	2016-10-10 17:11:50 -07:00
Linus Torvalds	abb5a14fa2	Merge branch 'work.misc' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs Pull misc vfs updates from Al Viro: "Assorted misc bits and pieces. There are several single-topic branches left after this (rename2 series from Miklos, current_time series from Deepa Dinamani, xattr series from Andreas, uaccess stuff from from me) and I'd prefer to send those separately" * 'work.misc' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs: (39 commits) proc: switch auxv to use of __mem_open() hpfs: support FIEMAP cifs: get rid of unused arguments of CIFSSMBWrite() posix_acl: uapi header split posix_acl: xattr representation cleanups fs/aio.c: eliminate redundant loads in put_aio_ring_file fs/internal.h: add const to ns_dentry_operations declaration compat: remove compat_printk() fs/buffer.c: make __getblk_slow() static proc: unsigned file descriptors fs/file: more unsigned file descriptors fs: compat: remove redundant check of nr_segs cachefiles: Fix attempt to read i_blocks after deleting file [ver #2] cifs: don't use memcpy() to copy struct iov_iter get rid of separate multipage fault-in primitives fs: Avoid premature clearing of capabilities fs: Give dentry to inode_change_ok() instead of inode fuse: Propagate dentry down to inode_change_ok() ceph: Propagate dentry down to inode_change_ok() xfs: Propagate dentry down to inode_change_ok() ...	2016-10-10 13:04:49 -07:00
Al Viro	e55f1d1d13	Merge remote-tracking branch 'jk/vfs' into work.misc	2016-10-08 11:06:08 -04:00
Andreas Gruenbacher	fd50ecaddf	vfs: Remove {get,set,remove}xattr inode operations These inode operations are no longer used; remove them. Signed-off-by: Andreas Gruenbacher <agruenba@redhat.com> Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>	2016-10-07 21:48:36 -04:00
Andreas Gruenbacher	6c6ef9f26e	xattr: Stop calling {get,set,remove}xattr inode operations All filesystems that support xattrs by now do so via xattr handlers. They all define sb->s_xattr, and their getxattr, setxattr, and removexattr inode operations use the generic inode operations. On filesystems that don't support xattrs, the xattr inode operations are all NULL, and sb->s_xattr is also NULL. This means that we can remove the getxattr, setxattr, and removexattr inode operations and directly call the generic handlers, or better, inline expand those handlers into fs/xattr.c. Filesystems that do not support xattrs on some inodes should clear the IOP_XATTR i_opflags flag in those inodes. (Right now, some filesystems have checks to disable xattrs on some inodes in the ->list, ->get, and ->set xattr handler operations instead.) The IOP_XATTR flag is automatically cleared in inodes of filesystems that don't have xattr support. In orangefs, symlinks do have a setxattr iop but no getxattr iop. Add a check for symlinks to orangefs_inode_getxattr to preserve the current, weird behavior; that check may not be necessary though. Signed-off-by: Andreas Gruenbacher <agruenba@redhat.com> Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>	2016-10-07 20:10:44 -04:00
Mike Marshall	f60fbdbf41	Revert "orangefs: bump minimum userspace version" The features op did make it into OrangeFS 2.9.6 after all. This reverts commit `0c95ad7636`.	2016-10-03 15:07:36 -04:00
Martin Brandenburg	b78b11985a	Merge branch 'misc' into for-next Pull in an OrangeFS branch containing miscellaneous improvements. - clean up debugfs globals - remove dead code in sysfs - reorganize duplicated sysfs attribute structs - consolidate sysfs show and store functions - remove duplicated sysfs_ops structures - describe organization of sysfs - make devreq_mutex static - g_orangefs_stats -> orangefs_stats for consistency - rename most remaining global variables	2016-09-28 14:50:46 -04:00
Deepa Dinamani	c2050a454c	fs: Replace current_fs_time() with current_time() current_fs_time() uses struct super_block* as an argument. As per Linus's suggestion, this is changed to take struct inode* as a parameter instead. This is because the function is primarily meant for vfs inode timestamps. Also the function was renamed as per Arnd's suggestion. Change all calls to current_fs_time() to use the new current_time() function instead. current_fs_time() will be deleted. Signed-off-by: Deepa Dinamani <deepa.kernel@gmail.com> Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>	2016-09-27 21:06:22 -04:00
Deepa Dinamani	078cd8279e	fs: Replace CURRENT_TIME with current_time() for inode timestamps CURRENT_TIME macro is not appropriate for filesystems as it doesn't use the right granularity for filesystem timestamps. Use current_time() instead. CURRENT_TIME is also not y2038 safe. This is also in preparation for the patch that transitions vfs timestamps to use 64 bit time and hence make them y2038 safe. As part of the effort current_time() will be extended to do range checks. Hence, it is necessary for all file system timestamps to use current_time(). Also, current_time() will be transitioned along with vfs to be y2038 safe. Note that whenever a single call to current_time() is used to change timestamps in different inodes, it is because they share the same time granularity. Signed-off-by: Deepa Dinamani <deepa.kernel@gmail.com> Reviewed-by: Arnd Bergmann <arnd@arndb.de> Acked-by: Felipe Balbi <balbi@kernel.org> Acked-by: Steven Whitehouse <swhiteho@redhat.com> Acked-by: Ryusuke Konishi <konishi.ryusuke@lab.ntt.co.jp> Acked-by: David Sterba <dsterba@suse.com> Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>	2016-09-27 21:06:21 -04:00
Miklos Szeredi	2773bf00ae	fs: rename "rename2" i_op to "rename" Generated patch: sed -i "s/\.rename2\t/\.rename\t\t/" `git grep -wl rename2` sed -i "s/\brename2\b/rename/g" `git grep -wl rename2` Signed-off-by: Miklos Szeredi <mszeredi@redhat.com>	2016-09-27 11:03:58 +02:00
Miklos Szeredi	1cd66c93ba	fs: make remaining filesystems use .rename2 This is trivial to do: - add flags argument to foo_rename() - check if flags is zero - assign foo_rename() to .rename2 instead of .rename This doesn't mean it's impossible to support RENAME_NOREPLACE for these filesystems, but it is not trivial, like for local filesystems. RENAME_NOREPLACE must guarantee atomicity (i.e. it shouldn't be possible for a file to be created on one host while it is overwritten by rename on another host). Filesystems converted: 9p, afs, ceph, coda, ecryptfs, kernfs, lustre, ncpfs, nfs, ocfs2, orangefs. After this, we can get rid of the duplicate interfaces for rename. Signed-off-by: Miklos Szeredi <mszeredi@redhat.com> Acked-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Acked-by: David Howells <dhowells@redhat.com> [AFS] Acked-by: Mike Marshall <hubcap@omnibond.com> Cc: Eric Van Hensbergen <ericvh@gmail.com> Cc: Ilya Dryomov <idryomov@gmail.com> Cc: Jan Harkes <jaharkes@cs.cmu.edu> Cc: Tyler Hicks <tyhicks@canonical.com> Cc: Oleg Drokin <oleg.drokin@intel.com> Cc: Trond Myklebust <trond.myklebust@primarydata.com> Cc: Mark Fasheh <mfasheh@suse.com>	2016-09-27 11:03:58 +02:00
Jan Kara	31051c85b5	fs: Give dentry to inode_change_ok() instead of inode inode_change_ok() will be resposible for clearing capabilities and IMA extended attributes and as such will need dentry. Give it as an argument to inode_change_ok() instead of an inode. Also rename inode_change_ok() to setattr_prepare() to better relect that it does also some modifications in addition to checks. Reviewed-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Jan Kara <jack@suse.cz>	2016-09-22 10:56:19 +02:00
Jan Kara	073931017b	posix_acl: Clear SGID bit when setting file permissions When file permissions are modified via chmod(2) and the user is not in the owning group or capable of CAP_FSETID, the setgid bit is cleared in inode_change_ok(). Setting a POSIX ACL via setxattr(2) sets the file permissions as well as the new ACL, but doesn't clear the setgid bit in a similar way; this allows to bypass the check in chmod(2). Fix that. References: CVE-2016-7097 Reviewed-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Jeff Layton <jlayton@redhat.com> Signed-off-by: Jan Kara <jack@suse.cz> Signed-off-by: Andreas Gruenbacher <agruenba@redhat.com>	2016-09-22 10:55:32 +02:00
Martin Brandenburg	0c95ad7636	orangefs: bump minimum userspace version OrangeFS 2.9.6 was released without support for the features op. Thus OrangeFS 2.9.7 will be required to use it. Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-09-21 12:37:23 -04:00
Martin Brandenburg	1d50361788	orangefs: rename most remaining global variables Only op_timeout_secs, slot_timeout_secs, and hash_table_size are left because they are exposed as module parameters. All other global variables have the orangefs_ prefix. Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-16 11:41:24 -04:00
Martin Brandenburg	889d5f1bac	orangefs: g_orangefs_stats -> orangefs_stats for consistency Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-15 15:33:42 -04:00
Martin Brandenburg	a0fe051592	orangefs: make devreq_mutex static Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-15 15:21:16 -04:00
Martin Brandenburg	c27889cdb4	orangefs: describe organization of sysfs Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-15 15:11:32 -04:00
Martin Brandenburg	4a3436647a	orangefs: remove duplicated sysfs_ops structures Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-15 15:01:30 -04:00
Martin Brandenburg	7b0cae60ff	orangefs: consolidate sysfs show and store functions Remove a good bit of obfuscated and duplicated code. Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-15 14:51:31 -04:00
Martin Brandenburg	2e9f80da07	orangefs: reorganize duplicated sysfs attribute structs We had a separate struct type for each type of attribute, but they all did the exact same thing. Consolidate them into one struct orangefs_attribute type. Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-15 14:02:39 -04:00
Martin Brandenburg	dc3012a7af	orangefs: remove dead code in sysfs We had a pageful of structures containing kobjects and variables to store sysfs entries. However only the kobjects were in use. Replace them with kobjects. Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-15 13:28:51 -04:00
Martin Brandenburg	44f4641073	orangefs: clean up debugfs globals Mostly this is moving code into orangefs-debugfs.c so that globals turn into static globals. Then gossip_debug_mask is renamed orangefs_gossip_debug_mask but keeps global visibility, so it can be used from a macro. Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-15 11:38:36 -04:00
Martin Brandenburg	c51e012942	orangefs: do not allow client readahead cache without feature bit Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-12 16:12:09 -04:00
Martin Brandenburg	482664ddba	orangefs: add features op This is a new userspace operation, which will be done if the client-core version is greater than or equal to 2.9.6. This will provide a way to implement optional features and to determine which features are supported by the client-core. If the client-core version is older than 2.9.6, no optional features are supported and the op will not be done. The intent is to allow protocol extensions without relying on the client-core's current behavior of ignoring what it doesn't understand. Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-12 15:12:54 -04:00
Martin Brandenburg	f2ee3b7595	orangefs: record userspace version for feature compatbility The client reports its version to the kernel on startup. We already test that it is above the minimum version. Now we record it in a global variable so code elsewhere can consult it before making a request the client may not understand. Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-09 16:25:51 -04:00
Martin Brandenburg	4d20a75677	orangefs: add readahead count and size to sysfs Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-08 15:12:29 -04:00
Martin Brandenburg	ed1e158777	orangefs: re-add flush_racache from out-of-tree Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-08 15:12:29 -04:00
Martin Brandenburg	680908e504	orangefs: turn param response value into union This will support a upcoming request where two related values need to be updated atomically. This was done without a union in the OrangeFS server source already. Since that will break the kernel protocol, it has been fixed there and done here in a way that does not break the kernel protocol. Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-08 15:12:28 -04:00
Martin Brandenburg	a6dff80a96	orangefs: add missing param request ops Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-08 15:12:27 -04:00
Martin Brandenburg	6eaff8c777	orangefs: rename remaining bits of mmap readahead cache This has been dormant code for many years. Parts of it were removed from the OrangeFS kernel code when it went into mainline. These bits were missed. Now the readahead cache has been resurrected in the OrangeFS userspace portions. It was renamed there, since it doesn't really have anything to do with mmap specifically, so it will be renamed here. Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-08 15:12:27 -04:00
Al Viro	f66debf1b3	orangefs: use %pd/%pD Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>	2016-08-07 23:38:47 -04:00
Martin Brandenburg	8bbb20a863	orangefs: Account for jiffies wraparound. Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-02 15:39:13 -04:00
Martin Brandenburg	957ee43718	orangefs: Change default dcache and getattr timeout to 50 msec. Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-02 15:38:47 -04:00
Martin Brandenburg	4cd8f31944	orangefs: Allow dcache and getattr cache time to be configured. Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-02 15:38:46 -04:00
Martin Brandenburg	71680c18c8	orangefs: Cache getattr results. The userspace component attempts to do this, but this will prevent us from even needing to go into userspace to satisfy certain getattr requests. Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-02 15:38:45 -04:00
Martin Brandenburg	31b7c1ab4e	orangefs: Use d_time to avoid excessive lookups Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-08-02 15:38:21 -04:00
Linus Torvalds	6784725ab0	Merge branch 'work.misc' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs Pull vfs updates from Al Viro: "Assorted cleanups and fixes. Probably the most interesting part long-term is ->d_init() - that will have a bunch of followups in (at least) ceph and lustre, but we'll need to sort the barrier-related rules before it can get used for really non-trivial stuff. Another fun thing is the merge of ->d_iput() callers (dentry_iput() and dentry_unlink_inode()) and a bunch of ->d_compare() ones (all except the one in __d_lookup_lru())" * 'work.misc' of git://git.kernel.org/pub/scm/linux/kernel/git/viro/vfs: (26 commits) fs/dcache.c: avoid soft-lockup in dput() vfs: new d_init method vfs: Update lookup_dcache() comment bdev: get rid of ->bd_inodes Remove last traces of ->sync_page new helper: d_same_name() dentry_cmp(): use lockless_dereference() instead of smp_read_barrier_depends() vfs: clean up documentation vfs: document ->d_real() vfs: merge .d_select_inode() into .d_real() unify dentry_iput() and dentry_unlink_inode() binfmt_misc: ->s_root is not going anywhere drop redundant ->owner initializations ufs: get rid of redundant checks orangefs: constify inode_operations missed comment updates from ->direct_IO() prototype change file_inode(f)->i_mapping is f->f_mapping trim fsnotify hooks a bit 9p: new helper - v9fs_parent_fid() debugfs: ->d_parent is never NULL or negative ...	2016-07-28 12:59:05 -07:00
Linus Torvalds	0e06f5c0de	Merge branch 'akpm' (patches from Andrew) Merge updates from Andrew Morton: - a few misc bits - ocfs2 - most(?) of MM * emailed patches from Andrew Morton <akpm@linux-foundation.org>: (125 commits) thp: fix comments of __pmd_trans_huge_lock() cgroup: remove unnecessary 0 check from css_from_id() cgroup: fix idr leak for the first cgroup root mm: memcontrol: fix documentation for compound parameter mm: memcontrol: remove BUG_ON in uncharge_list mm: fix build warnings in <linux/compaction.h> mm, thp: convert from optimistic swapin collapsing to conservative mm, thp: fix comment inconsistency for swapin readahead functions thp: update Documentation/{vm/transhuge,filesystems/proc}.txt shmem: split huge pages beyond i_size under memory pressure thp: introduce CONFIG_TRANSPARENT_HUGE_PAGECACHE khugepaged: add support of collapse for tmpfs/shmem pages shmem: make shmem_inode_info::lock irq-safe khugepaged: move up_read(mmap_sem) out of khugepaged_alloc_page() thp: extract khugepaged from mm/huge_memory.c shmem, thp: respect MADV_{NO,}HUGEPAGE for file mappings shmem: add huge pages support shmem: get_unmapped_area align huge page shmem: prepare huge= mount option and sysfs knob mm, rmap: account shmem thp pages ...	2016-07-26 19:55:54 -07:00
Michal Hocko	8a5c743e30	mm, memcg: use consistent gfp flags during readahead Vladimir has noticed that we might declare memcg oom even during readahead because read_pages only uses GFP_KERNEL (with mapping_gfp restriction) while __do_page_cache_readahead uses page_cache_alloc_readahead which adds __GFP_NORETRY to prevent from OOMs. This gfp mask discrepancy is really unfortunate and easily fixable. Drop page_cache_alloc_readahead() which only has one user and outsource the gfp_mask logic into readahead_gfp_mask and propagate this mask from __do_page_cache_readahead down to read_pages. This alone would have only very limited impact as most filesystems are implementing ->readpages and the common implementation mpage_readpages does GFP_KERNEL (with mapping_gfp restriction) again. We can tell it to use readahead_gfp_mask instead as this function is called only during readahead as well. The same applies to read_cache_pages. ext4 has its own ext4_mpage_readpages but the path which has pages != NULL can use the same gfp mask. Btrfs, cifs, f2fs and orangefs are doing a very similar pattern to mpage_readpages so the same can be applied to them as well. [akpm@linux-foundation.org: coding-style fixes] [mhocko@suse.com: restrict gfp mask in mpage_alloc] Link: http://lkml.kernel.org/r/20160610074223.GC32285@dhcp22.suse.cz Link: http://lkml.kernel.org/r/1465301556-26431-1-git-send-email-mhocko@kernel.org Signed-off-by: Michal Hocko <mhocko@suse.com> Cc: Vladimir Davydov <vdavydov@parallels.com> Cc: Chris Mason <clm@fb.com> Cc: Steve French <sfrench@samba.org> Cc: Theodore Ts'o <tytso@mit.edu> Cc: Jan Kara <jack@suse.cz> Cc: Mike Marshall <hubcap@omnibond.com> Cc: Jaegeuk Kim <jaegeuk@kernel.org> Cc: Changman Lee <cm224.lee@samsung.com> Cc: Chao Yu <yuchao0@huawei.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>	2016-07-26 16:19:19 -07:00
Jann Horn	78fee0b684	orangefs: fix namespace handling In orangefs_inode_getxattr(), an fsuid is written to dmesg. The kuid is converted to a userspace uid via from_kuid(current_user_ns(), [...]), but since dmesg is global, init_user_ns should be used here instead. In copy_attributes_from_inode(), op_alloc() and fill_default_sys_attrs(), upcall structures are populated with uids/gids that have been mapped into the caller's namespace. However, those upcall structures are read by another process (the userspace filesystem driver), and that process might be running in another namespace. This effectively lets any user spoof its uid and gid as seen by the userspace filesystem driver. To fix the second issue, I just construct the opcall structures with init_user_ns uids/gids and require the filesystem server to run in the init namespace. Since orangefs is full of global state anyway (as the error message in DUMP_DEVICE_ERROR explains, there can only be one userspace orangefs filesystem driver at once), that shouldn't be a problem. [ Why does orangefs even exist in the kernel if everything does upcalls into userspace? What does orangefs do that couldn't be done with the FUSE interface? If there is no good answer to those questions, I'd prefer to see orangefs kicked out of the kernel. Can that be done for something that shipped in a release? According to commit `f7ab093f74` ("Orangefs: kernel client part 1"), they even already have a FUSE daemon, and the only rational reason (apart from "but most of our users report preferring to use our kernel module instead") given for not wanting to use FUSE is one "in-the-works" feature that could probably be integated into FUSE instead. ] This patch has been compile-tested. Signed-off-by: Jann Horn <jannh@google.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-07-05 15:47:43 -04:00
Mike Marshall	3903f15008	Orangefs: allow O_DIRECT in open Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-07-05 15:47:35 -04:00
Andreas Gruenbacher	d373a712c1	orangefs: Remove useless xattr prefix arguments Mike, On Fri, Jun 3, 2016 at 9:44 PM, Mike Marshall <hubcap@omnibond.com> wrote: > We use the return value in this one line you changed, our userspace code gets > ill when we send it (-ENOMEM +1) as a key length... ah, my mistake. Here's a fixed version. Thanks, Andreas Signed-off-by: Andreas Gruenbacher <agruenba@redhat.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-07-05 15:47:27 -04:00
Andreas Gruenbacher	2ce8272a10	orangefs: Remove redundant "trusted." xattr handler Orangefs has a catch-all xattr handler that effectively does what the trusted handler does already. Signed-off-by: Andreas Gruenbacher <agruenba@redhat.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-07-05 15:47:22 -04:00
Andreas Gruenbacher	972a7344fc	orangefs: Remove useless defines The ORANGEFS_XATTR_INDEX_ defines are unused; the ORANGEFS_XATTR_NAME_ defines only obfuscate the code. Signed-off-by: Andreas Gruenbacher <agruenba@redhat.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-07-05 15:47:16 -04:00
Al Viro	6f3fc1070b	orangefs: constify inode_operations Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>	2016-05-29 19:07:00 -04:00
Al Viro	96b0cffbaf	orangefs: don't open-code %pd2 Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>	2016-05-29 16:22:07 -04:00
Al Viro	5930122683	switch xattr_handler->set() to passing dentry and inode separately preparation for similar switch in ->setxattr() (see the next commit for rationale). Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>	2016-05-27 15:39:43 -04:00
Al Viro	5ecfcb265f	orangefs: don't open-code inode_lock/inode_unlock Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>	2016-05-02 19:47:23 -04:00
Al Viro	84695ffee7	Merge getxattr prototype change into work.lookups The rest of work.xattr stuff isn't needed for this branch	2016-05-02 19:45:47 -04:00
Al Viro	b296821a7c	xattr_handler: pass dentry and inode as separate arguments of ->get() ... and do not assume they are already attached to each other Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>	2016-04-10 20:48:24 -04:00
Linus Torvalds	6759212640	Orangefs: cleanups and a strncpy vulnerability fix. Cleanups: - remove an unused variable from orangefs_readdir. - clean up printk wrapper used for ofs "gossip" debugging. - clean up truncate ctime and mtime setting in inode.c - remove a useless null check found by coccinelle. - optimize some memcpy/memset boilerplate code. - remove some useless sanity checks from xattr.c Fix: - fix a potential strncpy vulnerability. -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.22 (GNU/Linux) iQIcBAABAgAGBQJXCAvtAAoJEM9EDqnrzg2+U3kP/RcypLnTdnOLFI7GtTr7erpc 4ys4UE3CdvOdIkDNgTeW+qTvy/b7qo3JBdaRqMAAmnWkxWn4cGXDGLhmfKD/6a3Q LL9vKxiczYN8gAYgnxoF5rNwoVScFVD7PospKkWMPedkRu00K0dqiJvomPXsvwM7 MEl4ueewq3KMrXHu1GYSdoZhU8BWtku789Oa5leXRqEO2pRit0amX/qxB9KG6Urz Ebz0yCGMPP61nTN9A/Q7IM2imcJO5F0wc5uaSaWimjHeHb040ytMVqFa4GOh4Pky oQjjw5OYSsX/O7Hh66wKQ68YqYb762OKE1y5v8K43vRhWQtnQo9pdKpFVEK0An8Z wOSaUFd5mhPD2KENcKB/kSiH3eOyGdyD2QamptLJ6Opl/6UdPQMt++1EkuSuYEdW wnCFsJM5yam3Ot+REnQiYAVjLsDZ0XhPfNIuAp+d4LIV32JGTQPOBurKECwsJbj5 fK0lsBNk7b8qgBwG41JjV7XyGlf5HWT2TwpuC2S5PysVXcxsvdfR3oVeUnzdL6aB fv0mp7bBqg2lNLKoeEYlxb+vbSiLutOCIPWSNptYjbCWa/q/bqnZMhXOUQjS5XRj HIG/91XeRUxOZk+y1gs5N5F8pCT9mfWRTvSCX5ex5x17rUFto/jhn2mQLAhxSyPD /AGQgNGz2VmmVBlaCngI =ZRlK -----END PGP SIGNATURE----- Merge tag 'for-linus-4.6-ofs1' of git://git.kernel.org/pub/scm/linux/kernel/git/hubcap/linux Pull orangefs fixes from Mike Marshall: "Orangefs cleanups and a strncpy vulnerability fix. Cleanups: - remove an unused variable from orangefs_readdir. - clean up printk wrapper used for ofs "gossip" debugging. - clean up truncate ctime and mtime setting in inode.c - remove a useless null check found by coccinelle. - optimize some memcpy/memset boilerplate code. - remove some useless sanity checks from xattr.c Fix: - fix a potential strncpy vulnerability" * tag 'for-linus-4.6-ofs1' of git://git.kernel.org/pub/scm/linux/kernel/git/hubcap/linux: orangefs: remove unused variable orangefs: Add KERN_<LEVEL> to gossip_<level> macros orangefs: strncpy -> strscpy orangefs: clean up truncate ctime and mtime setting Orangefs: fix ifnullfree.cocci warnings Orangefs: optimize boilerplate code. Orangefs: xattr.c cleanup	2016-04-09 10:33:58 -07:00
Martin Brandenburg	e56f498142	orangefs: remove unused variable Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-04-08 15:50:44 -04:00
Joe Perches	1917a69328	orangefs: Add KERN_<LEVEL> to gossip_<level> macros Emit the logging messages at the appropriate levels. Miscellanea: o Change format to fmt o Use the more common ##__VA_ARGS__ Signed-off-by: Joe Perches <joe@perches.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-04-08 14:10:45 -04:00
Martin Brandenburg	2eacea74cc	orangefs: strncpy -> strscpy It would have been possible for a rogue client-core to send in a symlink target which is not NUL terminated. This returns EIO if the client-core gives us corrupt data. Leave debugfs and superblock code as is for now. Other dcache.c and namei.c strncpy instances are safe because ORANGEFS_NAME_MAX = NAME_MAX + 1; there is always enough space for a name plus a NUL byte. Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-04-08 14:10:34 -04:00
Martin Brandenburg	f83140c146	orangefs: clean up truncate ctime and mtime setting The ctime and mtime are always updated on a successful ftruncate and only updated on a successful truncate where the size changed. We handle the ``if the size changed'' bit. This matches FUSE's behavior. Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-04-08 14:10:31 -04:00
kbuild test robot	2fa37fd713	Orangefs: fix ifnullfree.cocci warnings fs/orangefs/orangefs-debugfs.c:130:2-26: WARNING: NULL check before freeing functions like kfree, debugfs_remove, debugfs_remove_recursive or usb_free_urb is not needed. Maybe consider reorganizing relevant code to avoid passing NULL values. NULL check before some freeing functions is not needed. Based on checkpatch warning "kfree(NULL) is safe this check is probably not required" and kfreeaddr.cocci by Julia Lawall. Generated by: scripts/coccinelle/free/ifnullfree.cocci Signed-off-by: Fengguang Wu <fengguang.wu@intel.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-04-08 14:08:38 -04:00
Mike Marshall	a9bb3ba81f	Orangefs: optimize boilerplate code. Suggested by David Binderman <dcb314@hotmail.com> The former can potentially be a performance win over the latter. memcpy(d, s, len); memset(d+len, c, size-len); memset(d, c, size); memcpy(d, s, len); Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-04-08 14:08:27 -04:00
Mike Marshall	2d09a2ca6a	Orangefs: xattr.c cleanup 1. It is nonsense to test for negative size_t, suggested by David Binderman <dcb314@hotmail.com> 2. By the time Orangefs gets called, the vfs has ensured that name != NULL, and that buffer and size are sane. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-04-08 14:08:27 -04:00
Linus Torvalds	4a2d057e4f	Merge branch 'PAGE_CACHE_SIZE-removal' Merge PAGE_CACHE_SIZE removal patches from Kirill Shutemov: "PAGE_CACHE_{SIZE,SHIFT,MASK,ALIGN} macros were introduced long time ago with promise that one day it will be possible to implement page cache with bigger chunks than PAGE_SIZE. This promise never materialized. And unlikely will. Let's stop pretending that pages in page cache are special. They are not. The first patch with most changes has been done with coccinelle. The second is manual fixups on top. The third patch removes macros definition" [ I was planning to apply this just before rc2, but then I spaced out, so here it is right _after_ rc2 instead. As Kirill suggested as a possibility, I could have decided to only merge the first two patches, and leave the old interfaces for compatibility, but I'd rather get it all done and any out-of-tree modules and patches can trivially do the converstion while still also working with older kernels, so there is little reason to try to maintain the redundant legacy model. - Linus ] * PAGE_CACHE_SIZE-removal: mm: drop PAGE_CACHE_* and page_cache_{get,release} definition mm, fs: remove remaining PAGE_CACHE_* and page_cache_{get,release} usage mm, fs: get rid of PAGE_CACHE_* and page_cache_{get,release} macros	2016-04-04 10:50:24 -07:00
Kirill A. Shutemov	09cbfeaf1a	mm, fs: get rid of PAGE_CACHE_* and page_cache_{get,release} macros PAGE_CACHE_{SIZE,SHIFT,MASK,ALIGN} macros were introduced long time ago with promise that one day it will be possible to implement page cache with bigger chunks than PAGE_SIZE. This promise never materialized. And unlikely will. We have many places where PAGE_CACHE_SIZE assumed to be equal to PAGE_SIZE. And it's constant source of confusion on whether PAGE_CACHE_* or PAGE_* constant should be used in a particular case, especially on the border between fs and mm. Global switching to PAGE_CACHE_SIZE != PAGE_SIZE would cause to much breakage to be doable. Let's stop pretending that pages in page cache are special. They are not. The changes are pretty straight-forward: - <foo> << (PAGE_CACHE_SHIFT - PAGE_SHIFT) -> <foo>; - <foo> >> (PAGE_CACHE_SHIFT - PAGE_SHIFT) -> <foo>; - PAGE_CACHE_{SIZE,SHIFT,MASK,ALIGN} -> PAGE_{SIZE,SHIFT,MASK,ALIGN}; - page_cache_get() -> get_page(); - page_cache_release() -> put_page(); This patch contains automated changes generated with coccinelle using script below. For some reason, coccinelle doesn't patch header files. I've called spatch for them manually. The only adjustment after coccinelle is revert of changes to PAGE_CAHCE_ALIGN definition: we are going to drop it later. There are few places in the code where coccinelle didn't reach. I'll fix them manually in a separate patch. Comments and documentation also will be addressed with the separate patch. virtual patch @@ expression E; @@ - E << (PAGE_CACHE_SHIFT - PAGE_SHIFT) + E @@ expression E; @@ - E >> (PAGE_CACHE_SHIFT - PAGE_SHIFT) + E @@ @@ - PAGE_CACHE_SHIFT + PAGE_SHIFT @@ @@ - PAGE_CACHE_SIZE + PAGE_SIZE @@ @@ - PAGE_CACHE_MASK + PAGE_MASK @@ expression E; @@ - PAGE_CACHE_ALIGN(E) + PAGE_ALIGN(E) @@ expression E; @@ - page_cache_get(E) + get_page(E) @@ expression E; @@ - page_cache_release(E) + put_page(E) Signed-off-by: Kirill A. Shutemov <kirill.shutemov@linux.intel.com> Acked-by: Michal Hocko <mhocko@suse.com> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>	2016-04-04 10:41:08 -07:00
Martin Brandenburg	878dfd3210	orangefs: minimum userspace version is 2.9.3 Version 2.9.4 isn't even released yet. Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-03-31 12:06:00 -04:00
Martin Brandenburg	641bb3246d	orangefs: don't put readdir slot twice This was quite an oversight. After a readdir, the module could not be unloaded, the number of slots is wrong, and memory near the slot bitmap is possibly corrupt. Oops. Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2016-03-31 12:06:00 -04:00
Al Viro	45996492e5	orangefs: fix orangefs_superblock locking * switch orangefs_remount() to taking ORANGEFS_SB(sb) instead of sb * remove from the list _before_ orangefs_unmount() - request_mutex in the latter will make sure that nothing observed in the loop in ORANGEFS_DEV_REMOUNT_ALL handling will get freed until the end of loop * on removal, keep the forward pointer and zero the back one. That way we can drop and regain the spinlock in the loop body (again, ORANGEFS_DEV_REMOUNT_ALL one) and still be able to get to the rest of the list. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-26 07:22:00 -04:00
Al Viro	6d4c1a30b3	orangefs: fix do_readv_writev() handling of error halfway through Error should only be returned if nothing had been read/written. Otherwise we need to report a short read/write instead. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-25 22:30:54 -04:00
Al Viro	524b1d3095	orangefs: have ->kill_sb() evict the VFS side of things first Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-25 22:30:54 -04:00
Al Viro	177f8fc491	orangefs: sanitize ->llseek() a) open files can't have NULL inodes b) it's SEEK_END, not ORANGEFS_SEEK_END; no need to get cute. c) make_bad_inode() on lseek()? Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-25 22:30:54 -04:00
Al Viro	7df240d771	orangefs-bufmap.h: trim unused junk Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-25 22:30:54 -04:00
Al Viro	b8a99a8f9f	orangefs: saner calling conventions for getting a slot just have it return the slot number or -E... - the caller checks the sign anyway Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-25 22:30:54 -04:00
Al Viro	bf6bf606e5	orangefs_copy_{to,from}_bufmap(): don't pass bufmap pointer it's always __orangefs_bufmap Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-25 22:30:54 -04:00
Al Viro	9f5e2f7f1b	orangefs: get rid of readdir_handle_s no point, really - we couldn't keep those across the calls of getdents(); it would be too easy to DoS, having all slots exhausted. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-25 22:30:54 -04:00
Martin Brandenburg	fecd86aac5	ornagefs: ensure that truncate has an up to date inode size Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-23 17:36:16 -04:00
Martin Brandenburg	e8da254c41	orangefs: move code which sets i_link to orangefs_inode_getattr Everything else setting inode->i_ values is in there. Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-23 17:36:16 -04:00
Martin Brandenburg	05d31c5cb3	orangefs: remove needless wrapper around GFP_KERNEL Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-23 17:36:15 -04:00
Martin Brandenburg	93d53a4885	orangefs: remove wrapper around mutex_lock(&inode->i_mutex) Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-23 17:36:15 -04:00
Martin Brandenburg	266626339b	orangefs: refactor inode type or link_target change detection Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-23 17:36:15 -04:00
Martin Brandenburg	5859d77e56	orangefs: use new getattr for revalidate and remove old getattr Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-23 17:36:15 -04:00
Martin Brandenburg	8f24928d19	orangefs: use new getattr in inode getattr and permission Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-23 17:36:15 -04:00
Martin Brandenburg	e2f7f0d798	orangefs: use new orangefs_inode_getattr to get size in write and llseek Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-23 17:36:14 -04:00
Martin Brandenburg	075cca50b6	orangefs: use new orangefs_inode_getattr to create new inodes Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-23 17:36:14 -04:00
Martin Brandenburg	3c9cf98d7b	orangefs: rename orangefs_inode_getattr to orangefs_inode_old_getattr This is motivated by orangefs_inode_old_getattr's habit of writing over live inodes. Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-23 17:36:14 -04:00
Martin Brandenburg	d57521a653	orangefs: remove inode->i_lock wrapper Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-23 17:36:13 -04:00
Martin Brandenburg	2f83ace371	orangefs: put register_chrdev immediately before register_filesystem Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-17 14:34:10 -04:00
Martin Brandenburg	a4c680a027	orangefs: remove paranoia in orangefs_set_inode Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-17 14:33:56 -04:00
Martin Brandenburg	02a5cc537d	orangefs: sanitize listxattr and return EIO on impossible values Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-17 14:33:47 -04:00
Martin Brandenburg	5e06664f29	orangefs: remove unused reference to xattr key length Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-17 14:33:47 -04:00
Mike Marshall	1a0ce16d71	Orangefs: adjust unwind on module init failure. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-17 13:24:34 -04:00
Mike Marshall	2180c52cc7	Orangefs: fix sloppy cleanups of debugfs and sysfs init failures. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-14 15:48:28 -04:00
Mike Marshall	a7d3e78ab5	Orangefs: follow_link -> get_link change Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-14 15:48:28 -04:00
Mike Marshall	53f57fef43	Orangefs: Extra sanity insurance on buffer before using string functions on it. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-14 15:48:28 -04:00
Martin Brandenburg	acfcbaf192	orangefs: make fs_mount_pending static Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-09 13:26:39 -05:00
Martin Brandenburg	c62da5853d	orangefs: Avoid symlink upcall if target is too long. Previously the client-core detected this condition by sheer luck! Since we used strncpy, no NUL byte would be included on the name. The client-core would call strlen, which would read past the end of its buffer, but return a number large enough that the client-core would return ENAMETOOLONG. Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-09 13:26:39 -05:00
Mike Marshall	162ada7764	Orangefs: improve the POSIXness of interrupted writes... Don't return EINTR on interrupted writes if some data has already been written. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-09 13:12:37 -05:00
Mike Marshall	cf07c0bf88	Orangefs: add a new gossip statement Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-09 13:11:45 -05:00
Mike Marshall	9d9e7ba9ee	Orangefs: improve gossip statements Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-03-03 13:46:48 -05:00
Mike Marshall	9f08cfe944	Orangefs: update orangefs.txt Al Viro has cleaned up the way ops are processed and waited for, now orangefs.txt has an overview of how it works. Several recent related commits have added to the comments in the code as well. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-26 14:39:08 -05:00
Mike Marshall	ca9f518ead	Orangefs: code sanitation. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-26 10:21:12 -05:00
Arnd Bergmann	401898eed7	orangefs: remove unused 'diff' function orangefs contains a helper function to calculate the difference between two timeval structures. We are trying to remove all instances of timespec from the kernel, and this one is not used at all, so let's remove it now. Signed-off-by: Arnd Bergmann <arnd@arndb.de> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-26 10:18:43 -05:00
Arnd Bergmann	be81ce48b2	orangefs: avoid time conversion function The new orangefs code uses a helper function to read a time field to its private structures from struct iattr. This will conflict with the move to 64-bit timestamps in the kernel and is generally not necessary. This replaces the conversion with a simple cast to time64_t that shows what is going on. As the orangefs-internal representation already uses 64-bit timestamps, there should be no ambiguity to negative values, and the cast ensures that we treat them as times before 1970 on both 32-bit and 64-bit architectures, rather than times after 2038. This patch keeps that behavior. Signed-off-by: Arnd Bergmann <arnd@arndb.de> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-26 10:18:39 -05:00
Martin Brandenburg	69a23de2f3	orangefs: clean up fill_default_sys_attrs Size and type are read-only and not in the mask. The times were left unset despite being in the mask. We zero-fill the times since the server will fill them in and we will get the correct time when we fill the inode with getattr. Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-24 17:07:51 -05:00
Martin Brandenburg	6ceaf7818f	orangefs: we never lookup with sym_follow set Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-24 17:07:51 -05:00
Martin Brandenburg	9c2bcf288e	orangefs: remove vestigial async io code I have verified that there is nothing in the userspace daemon version we are implementing this protocol against that ever looks at this field. Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-24 17:07:50 -05:00
Martin Brandenburg	47b4948fdb	orangefs: use ORANGEFS_NAME_LEN everywhere; remove ORANGEFS_NAME_MAX Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-24 17:07:50 -05:00
Martin Brandenburg	ee70fca0bc	orangefs: don't d_drop in d_revalidate since the caller will Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-24 17:07:50 -05:00
Martin Brandenburg	ee3b8d377c	orangefs: free readdir buffer index before the dir_emit loop We only need it while the service operation is actually in progress since it is only used to co-ordinate the client-core's memory use. The kernel allocates its own space. Also clean up some comments which mislead the reader into thinking the readdir buffers are shared memory. Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-24 17:07:50 -05:00
Mike Marshall	adcf34a289	Orangefs: code sanitation Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-24 16:54:27 -05:00
Mike Marshall	d37c0f307a	Orangefs: clean up orangefs_kernel_op_s comments. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-24 13:24:14 -05:00
Al Viro	c1223ca48b	orangefs: get rid of op refcounts not needed anymore Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-19 13:45:56 -05:00
Al Viro	05a50a5be8	orangefs: have ..._clean_interrupted_...() wait for copy to/from daemon * turn all those list_del(&op->list) into list_del_init() * don't pick ops that are already given up in control device ->read()/->write_iter(). * have orangefs_clean_interrupted_operation() notice if op is currently being copied to/from daemon (by said ->read()/->write_iter()) and wait for that to finish. * when we are done copying to/from daemon and find that it had been given up while we were doing that, wake the waiting ..._clean_interrupted_... As the result, we are guaranteed that orangefs_clean_interrupted_operation(op) doesn't return until nobody else can see op. Moreover, we don't need to play with op refcounts anymore. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-19 13:45:56 -05:00
Al Viro	5964c1b839	orangefs: set correct ->downcall.status on failing to copy reply from daemon ... and clean the end of control device ->write_iter() while we are at it Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-19 13:45:55 -05:00
Mike Marshall	ddb84da38d	Orangefs: remove vestigial ASYNC code Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-19 13:45:55 -05:00
Mike Marshall	5253487e04	Orangefs: make some gossip statements more helpful. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-19 13:45:55 -05:00
Al Viro	897c5df6cf	orangefs: get rid of op->done shouldn't be needed now Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-19 13:45:55 -05:00
Al Viro	82d37f19ff	orangefs_readdir_index_put(): get rid of bufmap argument Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-19 13:45:54 -05:00
Al Viro	ea2c9c9f65	orangefs: bufmap rewrite new waiting-for-slot logics: * make request for slot wait for bufmap to be set up if it comes before it's installed OR while it's running down * make closing control device wait for all slots to be freed * waiting itself rewritten to (open-coded) analogues of wait_event_... primitives - we would need wait_event_locked() and, pardon an obscenely long name, wait_event_interruptible_exclusive_timeout_locked(). * we never wait for more than slot_timeout_secs in total and, if during the wait the daemon goes away, we only allow ORANGEFS_BUFMAP_WAIT_TIMEOUT_SECS for it to come back. * (cosmetical) bitmap is used instead of an array of zeroes and ones * old (and only reached if we are about to corrupt memory) waiting for daemon restart in service_operation() removed. [Martin's fixes folded] Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-19 13:45:54 -05:00
Al Viro	178041848a	orangefs_bufmap_..._query(): don't bother with refcounts ... just hold the spinlock while fetching the field in question. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-19 13:45:54 -05:00
Al Viro	05b39a8b5c	orangefs: lift handling of timeouts and attempts count to service_operation() Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-19 13:45:54 -05:00
Al Viro	c72f15b7d9	service_operation(): don't block signals, just use ..._killable Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-19 13:45:53 -05:00
Al Viro	98815ade9e	orangefs: sanitize handling of request list * checking that daemon is running (to decide whether we want to limit the timeout) should be done after the damn thing is included into the list; doing that before means that if the daemon gets shut down in between, we'll end up waiting indefinitely (== up to kill -9). * cancels should go into the head of the queue - the sooner they are picked, the less work daemon has to do and the sooner we get to free the slot held by aborted operation. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-19 13:45:53 -05:00
Al Viro	d2d87a3b6d	orangefs: get rid of loop in wait_for_matching_downcall() turn op->waitq into struct completion... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-19 13:45:53 -05:00
Martin Brandenburg	cf22644a0e	orangefs: use S_ISREG(mode) and friends instead of mode & S_IFREG. Suggestion from Dan Carpenter. Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-19 13:45:53 -05:00
Al Viro	78699e29fd	orangefs: delay freeing slot until cancel completes Make cancels reuse the aborted read/write op, to make sure they do not fail on lack of memory. Don't issue a cancel unless the daemon has seen our read/write, has not replied and isn't being shut down. If cancel is issued, don't wait for it to complete; stash the slot in there and just have it freed when cancel is finally replied to or purged (and delay dropping the reference until then, obviously). Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-19 13:45:53 -05:00
Al Viro	1357d06d49	get rid of bufmap argument of orangefs_bufmap_put() it's always equal to __orangefs_bufmap and the latter can't change until we are done Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-12 15:05:33 -05:00
Al Viro	c0eae8cd77	orangefs: get rid of handle_io_error() the second caller never needs to cancel, actually Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-12 15:05:32 -05:00
Al Viro	7b9761af86	orangefs: wait_for_direct_io(): restore the position in iter when restarting Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-12 15:05:32 -05:00
Al Viro	e17be9fd4d	orangefs: avoid freeing a slot twice in wait_for_direct_io() Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-12 15:05:32 -05:00
Mike Marshall	6ebcc3fcda	Orangefs: added a couple of WARN_ONs, perhaps just temporarily. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-04 16:28:31 -05:00
Martin Brandenburg	237f8282c0	orangefs: Do not retrieve size from servers unless it it necessary. Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-04 14:12:43 -05:00
Martin Brandenburg	933287da75	orangefs: Implement inode_operations->permission(). Thus d_revalidate is not obliged to check on as much, which will eventually lead the way to hammering the filesystem servers much less. Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-04 14:12:37 -05:00
Martin Brandenburg	fe88adc366	orangefs: Only compare attributes specified in orangefs_inode_getattr. Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-04 14:12:17 -05:00
Mike Marshall	2d4cae0d17	Orangefs: clean up slab allocation. A couple of caches were no longer needed: - iov_iter improvements to orangefs_devreq_write_iter eliminated the need for the dev_req_cache. - removal (months ago) of the old AIO code eliminated the need for the kiocb_cache. Also, deobfuscation of use of GFP_KERNEL when calling kmem_cache_(z)alloc for remaining caches. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-04 13:48:16 -05:00
Mike Marshall	5090c9670d	Orangefs: improve gossip statement There were two just alike, making it hard maybe to tell which one you were looking at in syslog... so I changed it a little by adding some extra interesting tidbits to it... Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-02-04 13:29:27 -05:00
Martin Brandenburg	99109822f5	orangefs: Fix revalidate. Previously, it would update a live inode. This was fixed, but it did not ever check that the inode attributes in the dcache are correct. This checks all inode attributes and rejects any that are not correct, which causes a lookup and thus a new getattr. Perhaps inode_operations->permission should replace or augment some of this. There is no actual caching, and this does a rather excessive amount of network operations back to the filesystem server. Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-28 15:08:40 -05:00
Martin Brandenburg	394f647e3a	orangefs: Util functions shouldn't operate on inode where it can be avoided. Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-28 15:08:39 -05:00
Al Viro	115b93a859	orangefs: clean up op_alloc() fold orangefs_op_initialize() in there, don't bother locking something nobody else could've seen yet, use kmem_cache_zalloc() instead of explicit memset()... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 15:20:11 -05:00
Al Viro	b0bc3a7b62	orangefs: move handle_io_error() to file.c Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 15:20:11 -05:00
Al Viro	2a9e5c2260	orangefs: don't reinvent completion.h... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 15:20:11 -05:00
Al Viro	4f55e39732	if ORANGEFS_VFS_OP_FILE_IO request had been given up, don't bother waiting ... we are not going to get woken up anyway, so it's just going to time out and whine. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 15:20:11 -05:00
Al Viro	727cbfea62	orangefs: get rid of MSECS_TO_JIFFIES All timeouts are in _seconds_, so all calls are of form MSECS_TO_JIFFIES(n * 1000), which is a convoluted way to spell n * HZ. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 15:20:11 -05:00
Al Viro	eab9b38939	orangefs_clean_up_interrupted_operation: call with op->lock held Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 15:15:09 -05:00
Al Viro	70c6ea26ff	orangefs: reduce nesting in wait_for_matching_downcall() reorder if branches... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 15:15:09 -05:00
Al Viro	e1056a9cc3	orangefs: remove cargo-culting spin_lock_irqsave() in service_operation() Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 15:15:09 -05:00
Al Viro	ed42fe0593	orangefs: hopefully saner op refcounting and locking * create with refcount 1 * make op_release() decrement and free if zero (i.e. old put_op() has become that). * mark when submitter has given up waiting; from that point nobody else can move between the lists, change state, etc. * have daemon read/write_iter grab a reference when picking op and always give it up in the end * don't put into hash until we know it's been successfully passed to daemon * move op->lock _lower_ than htab_in_progress_lock (and make sure to take it in purge_inprogress_ops()) Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 13:03:12 -05:00
Al Viro	fee25ce125	orangefs: make sure that reopening pvfs2-req won't overlap with the end of close Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 12:55:24 -05:00
Al Viro	96acf9d65e	orangefs: nothing should remain in request list and in hash ... otherwise some thread is running in .text that is about to be freed. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 12:55:24 -05:00
Al Viro	60831949cc	orangefs: move wakeups into set_op_state_{serviced,purged}() Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 12:55:24 -05:00
Al Viro	ade3d78104	orangefs: make wait_for_...downcall() static Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 12:48:02 -05:00
Al Viro	831d094979	orangefs: move wakeups into set_op_state_{serviced,purged}() Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 12:42:43 -05:00
Al Viro	b7ae37b09e	orangefs: make wait_for_...downcall() static Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 12:42:43 -05:00
Al Viro	e07db0a2c2	make orangefs_clean_up_interrupted_operation() static Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 12:42:43 -05:00
Al Viro	1264ddfdb7	orangefs: kill orangefs_inode_s ->list no users... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 12:42:43 -05:00
Al Viro	fc916da52d	orangefs: get rid of <censored> macros Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 12:42:43 -05:00
Al Viro	90e54e36c9	orangefs: ->poll() doesn't need spinlock not just for list_empty()... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 12:42:43 -05:00
Al Viro	8016387ce7	orangefs: kill ioctl32 rudiments Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 12:42:43 -05:00
Al Viro	83595db052	orangefs: ->poll() is only called between successful ->open() and ->release() Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 12:42:43 -05:00
Al Viro	fb6d2526e9	orangefs: generic_file_open() is pointless for character devices Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 12:42:43 -05:00
Al Viro	3e1dd9aa82	orangefs: use DEFINE_MUTEX (and mutex_init() had been too late) Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-23 12:42:43 -05:00
Mike Marshall	cf0c27715b	Orangefs: make gossip statement more palatable to xtensa Thanks to Intel's kbuild test robot Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-19 12:04:40 -05:00
Mike Marshall	1808f8cc6c	Orangefs: add verification to decode_dirents Also add comments to decode_dirents and make it more readable. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-15 13:10:52 -05:00
Mike Marshall	be57366e14	Orangefs: make .statfs gossip_debug more complete. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-13 11:38:14 -05:00
Mike Marshall	569dbfc6b3	Orangefs: define a minimum compatible userspace version. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-13 11:36:25 -05:00
Mike Marshall	4c27b327b8	Orangefs: change ORANGEFS_VERSION from "Unknown" to "upstream" Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-13 11:34:59 -05:00
Mike Marshall	c817e266e4	Orangefs: rename orangefs_kernel_op_s.aio_ref_count to just ref_count. The op structure's ref_count member hasn't got anything to do with asynchronous I/O. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-13 11:29:05 -05:00
Mike Marshall	b3ae4755f5	Orangefs: implement .write_iter Until now, orangefs_devreq_write_iter has just been a wrapper for the old-fashioned orangefs_devreq_writev... linux would call .write_iter with "struct kiocb iocb" and "struct iov_iter iter" and .write_iter would just: return pvfs2_devreq_writev(iocb->ki_filp, iter->iov, iter->nr_segs, &iocb->ki_pos); Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-13 11:18:12 -05:00
Martin Brandenburg	7d2214858f	orangefs: Fix some more global namespace pollution. This only changes the names of things, so there is no functional change. Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-04 16:21:46 -05:00
Richard Weinberger	c146c0b87f	orangefs: Don't pollute global namespace Prefix public functions with "orangefs_" do don't pollute the global namespace. This fixes a build issue on UML which also has block_signals(). Signed-off-by: Richard Weinberger <richard@nod.at> Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-04 16:21:37 -05:00
Julia Lawall	acaca36dd9	OrangeFS: constify export_operations structures This export_operations structure is never modified, so declare it as const. Most other structures of this type are already const. Done with the help of Coccinelle. Signed-off-by: Julia Lawall <Julia.Lawall@lip6.fr> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2016-01-04 11:06:12 -05:00
Mike Marshall	f987f4c28a	Orangefs: don't trigger copy_attributes_to_inode from d_revalidate. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-12-30 13:04:28 -05:00
Arnd Bergmann	eb57bcc271	orangefs: fix typo in ornagefs_inode_lock Orangefs fails to build on 32-bit SMP configurations due to a simple misspelling, this does the obvious fix. Signed-off-by: Arnd Bergmann <arnd@arndb.de> Fixes: `575e946125` ("Orangefs: change pvfs2 filenames to orangefs") Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-12-28 10:31:21 -05:00
Nicholas Mc Guire	dde58ca436	Orangefs: use kzalloc for kmalloc + memset 0 This is an API consolidation only. The use of kmalloc + memset to 0 should be equivalent to kzalloc in this case. Signed-off-by: Nicholas Mc Guire <hofrat@osadl.org> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-12-28 10:30:51 -05:00
Mike Marshall	62441fa53b	Orangefs: validate resp.listxattr.returned_count Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-12-17 16:11:40 -05:00
Mike Marshall	fef8b67ce6	Orangefs: don't use deprecated xattr defines. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-12-17 14:36:34 -05:00
Martin Brandenburg	b09d10df5a	orangefs: Do not unref if there is no bufmap. Signed-off-by: Mike Marshall <hubcap@omnibond.com> Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2015-12-17 14:35:45 -05:00
Martin Brandenburg	765a75b34a	orangefs: Remove useless inline qualifier from bufmap functions. All callers were outside of the file these functions were declared in, so nothing was ever inlined anyway. Further this happens before I/O and any speedup by not having to do a call will be dwarfed by the time it takes to talk to the server. Signed-off-by: Mike Marshall <hubcap@omnibond.com> Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2015-12-17 14:34:53 -05:00
Martin Brandenburg	bf89f58432	orangefs: Change visibility of several bufmap helpers to static. Signed-off-by: Mike Marshall <hubcap@omnibond.com> Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2015-12-17 14:34:17 -05:00
Martin Brandenburg	a762ae6dc5	orangefs: Remove ``aligned'' upcall and downcall length macros. There was previously MAX_ALIGNED_DEV_REQ_(UP\|DOWN)SIZE macros which evaluated to MAX_DEV_REQ_(UP\|DOWN)SIZE+8. As it is unclear what this is for, other than creating a situation where we accept more data than we can parse, it is removed. Signed-off-by: Mike Marshall <hubcap@omnibond.com> Signed-off-by: Martin Brandenburg <martin@omnibond.com>	2015-12-17 14:33:38 -05:00
Martin Brandenburg	90d26aa808	Orangefs: do not finalize bufmap if it was never initialized. Found by the infant Orangefs fuzzer... Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-12-15 15:37:53 -05:00
Mike Marshall	ce6c414e17	Orangefs: Don't wait the old-fashioned way. Get rid of add_wait_queue, set_current_state, etc, and use the wait_event() model. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-12-14 14:54:46 -05:00
Mike Marshall	97f100277c	Orangefs: de-uglify orangefs_devreq_writev, and devorangefs-req.c in general AV dislikes many parts of orangefs_devreq_writev. Besides making orangefs_devreq_writev more easily readable and better commented, this patch makes an effort to address some of the problems: > The 5th is quietly ignored unless trailer_size is positive and > status is zero. If trailer_size > 0 && status == 0, you verify that > the length of the 5th segment is no more than trailer_size and copy > it to vmalloc'ed buffer. Without bothering to zero the rest of that > buffer out. It was just wrong to allow a 5th segment that is not exactly equal to trailer_size. Now that that's fixed, there's nothing to zero out in the vmalloced buffer - it is exactly the right size to hold the 5th segment. > Another API bogosity: when the 5th segment is present, successful writev() > returns the sum of sizes of the first 4. Added size of 5th segment to writev return... > if concatenation of the first 4 segments is longer than > 16 + sizeof(struct pvfs2_downcall_s) by no more than sizeof(long) => whine > and proceed with garbage. If 4th segment isn't exactly sizeof(struct pvfs2_downcall_s), whine and fail. > if the 32bit value 4 bytes into op->downcall is zero and 64bit > value following it is non-zero, the latter is interpreted as the size of > trailer data. The latter is what userspace claimed was the length of the trailer data. The kernel module now compares it to the trailer iovec's iov_len as a sanity check. > if there's no trailer, the 5th segment (if present) is completely ignored. Whine and fail if there should be no trailer, yet a 5th segment is present. > if vmalloc fails, act as if status (32bit at offset 5 into > op->downcall) had been -ENOMEM and don't look at the 5th segment at all. whine and fail with -ENOMEM. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-12-14 13:32:05 -05:00
Mike Marshall	b4cf67a2ba	Orangef: remove overlooked old-style userspace debug parts Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-12-11 11:00:12 -05:00
Mike Marshall	b5e376ea8b	Orangefs: improve comments Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-12-11 10:50:42 -05:00
Mike Marshall	7cec28e91d	Orangefs: don't keep checking stuff in on Friday afternoon. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-12-11 10:46:22 -05:00
Mike Marshall	e91f1c8043	Orangefs: don't expose internal details of pathname resolution to userspace. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-12-04 13:06:55 -05:00
Mike Marshall	575e946125	Orangefs: change pvfs2 filenames to orangefs Also changed references within source files that referred to header files whose names had changed. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-12-04 12:56:14 -05:00
Yi Liu	8bb8aefd5a	OrangeFS: Change almost all instances of the string PVFS2 to OrangeFS. OrangeFS was formerly known as PVFS2 and retains the name in many places. I leave the device /dev/pvfs2-req since this affects userspace. I leave the filesystem type pvfs2 since this affects userspace. Further the OrangeFS sysint library reads fstab for an entry of type pvfs2 independently of kernel mounts. I leave extended attribute keys user.pvfs2 and system.pvfs2 as the sysint library understands these. I leave references to userspace binaries still named pvfs2. I leave the filenames. Signed-off-by: Yi Liu <yi9@clemson.edu> [martin@omnibond.com: clairify above constraints and merge] Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-12-03 14:45:44 -05:00
Al Viro	555fa0fa61	fs: out of bounds on stack in iov_iter_advance On Wed, Nov 11, 2015 at 10:19:48AM +0000, Al Viro wrote: > I'll cook the minimal fixup for API change after I get some sleep and > send it your way, unless somebody gets there first... This should do it - switches ->ioctl() to pvfs2_inode_[gs]etxattr() and converts xattr_handler ->[gs]et() to new API. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-16 11:27:24 -05:00
Martin Brandenburg	24c8d0804b	Orangefs: Clean up pvfs2_devreq_read. * Kick invalid arguments out early, so handling them does not clutter the code. * Avoid possibility of race by not releasing lock until completely done. * Do not leak ops (memory) in certain error condition. * Check for more error conditions. * Put module name in all error and debug logs. * Document behavior. Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 14:50:20 -05:00
Martin Brandenburg	f0ed4418d4	Orangefs: Remove upcall trailers which are not used. Also removes remnants of iox (readx/writex) which previously used trailers, but no longer exist. Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 14:43:11 -05:00
Mike Marshall	b5bbc84328	Orangefs: fix gossip statement Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 14:39:15 -05:00
Martin Brandenburg	6d0dd7684c	Orangefs: Remove unused #defines from signal blocking code. Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 13:36:51 -05:00
Mike Marshall	3f1b6947dc	Orangefs: set pos after generic_write_checks if we are appending, generic_write_checks would have updated pos to the end of the file... Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 13:05:11 -05:00
Al Viro	8092895f75	orangefs: validate the response in decode_dirents() Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 12:02:12 -05:00
Al Viro	ade1d48b78	orangefs: don't leave uninitialized data in ->trailer_buf minimal fix; it would be better to reject such requests outright. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 11:57:34 -05:00
Al Viro	9be68b0871	orangefs: get rid of dec_string and enc_string The latter is never used, the former has one user and would be better off spelled out right there. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 11:56:17 -05:00
Al Viro	ef4af94edc	orangefs: switch decode_dirents() to use of kcalloc() gets rid of multiplication overflow Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 11:55:39 -05:00
Al Viro	5714156be2	orangefs: sanitize pvfs2_convert_time_field() Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 11:54:05 -05:00
Al Viro	aada5c5872	orangefs: kill pointless ->link() and ->mknod() Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 11:53:24 -05:00
Al Viro	75992b0fa9	pvfs2_fill_sb(): use kzalloc() Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 11:52:52 -05:00
Al Viro	5c0dbbc64b	orangefs: kill struct pvfs2_mount_sb_info_s The only reason for that thing used to be the API of mount_nodev() callback; since we are calling pvfs2_fill_sb() ourselves now, we don't have to shove everything into a single structure. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 11:39:31 -05:00
Al Viro	b05a785109	orangefs: double iput() in case of d_make_root() failure Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 11:38:51 -05:00
Al Viro	16742f2d7c	orangefs: use get_user_pages_fast(), not get_user_pages() Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 11:38:02 -05:00
Al Viro	a0435ca18e	orangefs: kill kmap/kunmap wrappers Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 11:37:19 -05:00
Al Viro	74f68fce2a	orangefs: make pvfs2_inode_read() take iov_iter ... and make the only caller use page-backed iov_iter, getting rid of kmap/kunmap and of the bug with attempted use of iovec-backed copy_page_to_iter() on a kernel pointer. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 11:36:31 -05:00
Al Viro	0071ed1ec6	orangefs: make do_readv_writev() take iov_iter no need to build a copy of what the caller already has; what's more, we want the one given to caller properly advanced and we shouldn't depend upon it being an iovec-backed one. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 11:35:28 -05:00
Al Viro	dc4067f671	orangefs: don't bother with splitting iovecs copy_page_{to,from}_iter() advances it just fine and it has no problem with partially consumed segments. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 11:23:02 -05:00
Al Viro	3c2fcfcb68	orangefs: make wait_for_direct_io() take iov_iter incidentally, insane or compromised server returning more than requested on read should not oops the kernel - initialize the iov_iter for read according to the iovec we've got. That's why pvfs_bufmap_copy_to_iovec() needed a separate size argument - we shouldn't abuse iov_iter_count(iter) for passing that. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 11:21:37 -05:00
Al Viro	a5c126a522	orangefs: make precopy_buffers() take iov_iter Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 11:18:50 -05:00
Al Viro	5f0e3c953f	orangefs: make postcopy_buffers() take iov_iter Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 11:11:55 -05:00
Al Viro	34204fde4c	pvfs_bufmap_copy_from_iovec(): don't rely upon size being equal to iov_iter_count(iter) Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 10:57:53 -05:00
Al Viro	5c278228bb	orangefs: explicitly pass the size to pvfs_bufmap_copy_to_iovec() Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-11-13 10:25:01 -05:00
Mike Marshall	548049495c	Orangefs: fix some checkpatch.pl complaints that had creeped in. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-05 13:44:24 -04:00
Martin Brandenburg	894ac432b4	Orangefs: Clean up error decoding. Errors from the server need to be decoded. A bunch of code was imported from the server to do this but much of it is convoluted and not even needed. The result is better but still as convoluted as required by the protocol. Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 12:04:46 -04:00
Martin Brandenburg	353908035f	Orangefs: Use readonly mmap since writepage is not implemented. Previously the code silently failed to update the disk. Now it will not allow writable and shared mmaps. Signed-off-by: Martin Brandenburg <martin@omnibond.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:44:39 -04:00
Mike Marshall	1be21f865a	Orangefs: don't use mount_nodev, use sget directly. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:44:37 -04:00
Mike Marshall	8c3905adea	Orangefs: update signal blocking code before Oleg sees it. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:44:36 -04:00
Mike Marshall	50e01586f4	Orangefs: Don't opencode memcpy. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:44:35 -04:00
Mike Marshall	f957ae2dec	Orangefs: choose return codes from among the expected ones. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:44:34 -04:00
Mike Marshall	d6fe654b7b	Orangefs: put PVFS_util_min out of its misery. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:44:33 -04:00
Mike Marshall	88309aae3d	Orangefs: fix dir_emit code in pvfs2_readdir. Al Viro glanced at readdir and surmised that getdents would misbehave the way it was written... and sure enough. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:44:32 -04:00
Mike Marshall	4d1c44043b	Orangefs: use iov_iter interface replace opencoded pvfs_bufmap_copy_to_kernel_iovec, pvfs_bufmap_copy_to_user_iovec, pvfs_bufmap_copy_iovec_from_kernel, and pvfs_bufmap_copy_iovec_from_user with pvfs_bufmap_copy_to_iovec and pvfs_bufmap_copy_from_iovec, which both use the iov_iter interface. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:44:00 -04:00
Guenter Roeck	81b784b11e	Orangefs: Swap order of include files spinlock_types.h requires types from linux/types.h. Including spinlock_types.h first may result in the following build errors, as seen with arm:allmodconfig. arch/arm/include/asm/spinlock_types.h:12:3: error: unknown type name 'u32' arch/arm/include/asm/spinlock_types.h:16:4: error: unknown type name 'u16' Fixes: deb4fb58ff73 ("Orangefs: kernel client part 2") Cc: Mark Brown <broonie@kernel.org> Cc: Mike Marshall <hubcap@omnibond.com> Signed-off-by: Guenter Roeck <linux@roeck-us.net> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:40:08 -04:00
Sasha Levin	cb987f3cbe	fs: orangefs: remove execute priviliges from module params This makes no sense and causes warnings on boot. Signed-off-by: Sasha Levin <sasha.levin@oracle.com> Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:40:06 -04:00
Mike Marshall	f0566532fe	Orangefs: use inode_set_bytes for directories Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:40:05 -04:00
Mike Marshall	c36316b74e	Orangefs: large integer implicitly truncated to unsigned type make.cross ARCH=tile doesn't like "inode->i_bytes = PAGE_CACHE_SIZE;", so cast PAGE_CACHE_SIZE to unsigned short. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:40:04 -04:00
Mike Marshall	eeaa3d448c	Orangefs: address problems found by static checker Don't check for negative rc from boolean. Don't pointlessly initialize variables, it short-circuits gcc's uninitialized variable warnings. And max_new_nr_segs can never be zero, so don't check for it. Preserve original kstrdup pointer for freeing later. Don't check for negative value in unsigned variable. Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:40:03 -04:00
Mike Marshall	84d02150de	Orangefs: sooth most sparse complaints Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:40:02 -04:00
Mike Marshall	2c590d5fb6	Orangefs: kernel client update 1. Stephen Rothwell noticed that orangefs would not compile on powerpc... Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:40:01 -04:00
Mike Marshall	07f01962e3	Orangefs: kernel client part 7 Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:40:00 -04:00
Mike Marshall	1182fca3bc	Orangefs: kernel client part 5 Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:39:57 -04:00
Mike Marshall	f7be4ee07f	Orangefs: kernel client part 4 Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:39:56 -04:00
Mike Marshall	274dcf55bd	Orangefs: kernel client part 3 Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:39:55 -04:00
Mike Marshall	5db11c21a9	Orangefs: kernel client part 2 Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:39:54 -04:00
Mike Marshall	f7ab093f74	Orangefs: kernel client part 1 OrangeFS (formerly PVFS) is an lgpl licensed userspace networked parallel file system. OrangeFS can be accessed through included system utilities, user integration libraries, MPI-IO and can be used by the Hadoop ecosystem as an alternative to the HDFS filesystem. OrangeFS is used widely for parallel science, data analytics and engineering applications. While applications often don't require Orangefs to be mounted into the VFS, users do like to be able to access their files in the normal way. The Orangefs kernel client allows Orangefs filesystems to be mounted as a VFS. The kernel client communicates with a userspace daemon which in turn communicates with the Orangefs server daemons that implement the filesystem. The server daemons (there's almost always more than one) need not be running on the same host as the kernel client. Orangefs filesystems can also be mounted with FUSE, and we ship code and instructions to facilitate that, but most of our users report preferring to use our kernel module instead. Further, as an example of a problem we can't solve with fuse, we have in the works a not-yet-ready-for-prime-time version of a file_operations lock function that accounts for the server daemons being distributed across more than one running kernel. Many people and organizations, including Clemson University, Argonne National Laboratories and Acxiom Corporation have helped to create what has become Orangefs over more than twenty years. Some of the more recent contributors to the kernel client include: Mike Marshall Christoph Hellwig Randy Martin Becky Ligon Walt Ligon Michael Moore Rob Ross Phil Carnes Signed-off-by: Mike Marshall <hubcap@omnibond.com>	2015-10-03 11:39:53 -04:00

... 3 4 5 6 7 ...

444 Commits