op-kernel-dev - Development kernel branch for OpenPOWER systems

	Commit message (Collapse)	Author	Age	Files	Lines
*	NFS: Fake up 'wcc' attributes to prevent cache invalidation after write	Trond Myklebust	2007-10-09	1	-1/+2
\| \| \| \| \| \| \| \| \|	NFSv2 and v4 don't offer weak cache consistency attributes on WRITE calls. In NFSv3, returning wcc data is optional. In all cases, we want to prevent the client from invalidating our cached data whenever ->write_done() attempts to update the inode attributes. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFSv4: Make NFSv4 ACCESS calls return attributes too...	Trond Myklebust	2007-10-09	1	-0/+3
\| \| \| \| \| \| \|	It doesn't really make sense to cache an access call without also revalidating the attributes. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	SUNRPC: Clean up the sillyrename code	Trond Myklebust	2007-07-19	1	-3/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Fix a couple of bugs: - Don't rely on the parent dentry still being valid when the call completes. Fixes a race with shrink_dcache_for_umount_subtree() - Don't remove the file if the filehandle has been labelled as stale. Fix a couple of inefficiencies - Remove the global list of sillyrenamed files. Instead we can cache the sillyrename information in the dentry->d_fsdata - Move common code from unlink_setup/unlink_done into fs/nfs/unlink.c Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Introduce struct nfs_removeargs+nfs_removeres	Trond Myklebust	2007-07-19	1	-12/+15
\| \| \| \| \| \| \|	We need a common structure for setting up an unlink() rpc call in order to fix the asynchronous unlink code. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFSv4: Reduce the chances of an open_owner identifier collision	Trond Myklebust	2007-07-10	1	-2/+2
\| \| \| \| \| \|	Currently we just use a 32-bit counter. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS4: on a O_EXCL OPEN make sure SETATTR sets the fields holding the verifier	Jeff Layton	2007-07-10	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The Linux NFS4 client simply skips over the bitmask in an O_EXCL open call and so it doesn't bother to reset any fields that may be holding the verifier. This patch has us save the first two words of the bitmask (which is all the current client has #defines for). The client then later checks this bitmask and turns on the appropriate flags in the sattr->ia_verify field for the following SETATTR call. This patch only currently checks to see if the server used the atime and mtime slots for the verifier (which is what the Linux server uses for this). I'm not sure of what other fields the server could reasonably use, but adding checks for others should be trivial. Signed-off-by: Jeff Layton <jlayton@redhat.com> Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	Merge branch 'master' of /home/trondmy/kernel/linux-2.6/	Trond Myklebust	2007-02-12	1	-2/+2
\|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Conflicts: net/sunrpc/auth_gss/gss_krb5_crypto.c net/sunrpc/auth_gss/gss_spkm3_token.c net/sunrpc/clnt.c Merge with mainline and fix conflicts.
\| *	[PATCH] mark struct inode_operations const 3	Arjan van de Ven	2007-02-12	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Many struct inode_operations in the kernel can be "const". Marking them const moves these to the .rodata section, which avoids false sharing with potential dirty data. In addition it'll catch accidental writes at compile time to these shared resources. Signed-off-by: Arjan van de Ven <arjan@linux.intel.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
* \|	NFS: Remove nfs_readpage_sync()	Trond Myklebust	2007-02-03	1	-1/+0
\|/ \| \| \| \| \|	It makes no sense to maintain 2 parallel systems for reading in pages. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Remove nfs_writepage_sync()	Trond Myklebust	2006-12-06	1	-2/+0
\| \| \| \| \| \| \| \|	Maintaining two parallel ways of doing synchronous writes is rather pointless. This patch gets rid of the legacy nfs_writepage_sync(), and replaces it with the faster asynchronous writes. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	[PATCH] nfs: verifier is network-endian	Al Viro	2006-10-20	1	-4/+4
\| \| \| \| \| \| \| \|	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Acked-by: Trond Myklebust <trond.myklebust@fys.uio.no> Acked-by: Neil Brown <neilb@suse.de> Signed-off-by: Andrew Morton <akpm@osdl.org> Signed-off-by: Linus Torvalds <torvalds@osdl.org>
*	[PATCH] xdr annotations: NFS readdir entries	Al Viro	2006-10-20	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	on-the-wire data is big-endian [in large part pulled from Alexey's patch] Signed-off-by: Al Viro <viro@zeniv.linux.org.uk> Acked-by: Trond Myklebust <trond.myklebust@fys.uio.no> Acked-by: Neil Brown <neilb@suse.de> Signed-off-by: Andrew Morton <akpm@osdl.org> Signed-off-by: Linus Torvalds <torvalds@osdl.org>
*	NFS: Use cached page as buffer for NFS symlink requests	Chuck Lever	2006-09-22	1	-7/+10
\| \| \| \| \| \| \| \| \| \| \|	Now that we have a copy of the symlink path in the page cache, we can pass a struct page down to the XDR routines instead of a string buffer. Test plan: Connectathon, all NFS versions. Signed-off-by: Chuck Lever <chuck.lever@oracle.com> Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Fix double d_drop in nfs_instantiate() error path	Chuck Lever	2006-09-22	1	-3/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If the LOOKUP or GETATTR in nfs_instantiate fail, nfs_instantiate will do a d_drop before returning. But some callers already do a d_drop in the case of an error return. Make certain we do only one d_drop in all error paths. This issue was introduced because over time, the symlink proc API diverged slightly from the create/mkdir/mknod proc API. To prevent other coding mistakes of this type, change the symlink proc API to be more like create/mkdir/mknod and move the nfs_instantiate call into the symlink proc routines so it is used in exactly the same way for create, mkdir, mknod, and symlink. Test plan: Connectathon, all versions of NFS. Signed-off-by: Chuck Lever <chuck.lever@oracle.com> Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	SUNRPC: remove extraneous header inclusions	Chuck Lever	2006-09-22	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \|	include/linux/sunrpc/clnt.h already includes include/linux/sunrpc/xprt.h. We can remove xprt.h from source files that already include clnt.h. Likewise include/linux/sunrpc/timer.h. Test plan: Compile kernel with CONFIG_NFS enabled. Signed-off-by: Chuck Lever <chuck.lever@oracle.com> Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Add extra const qualifiers	David Howells	2006-09-22	1	-3/+3
\| \| \| \| \| \| \|	Add some extra const qualifiers into NFS. Signed-Off-By: David Howells <dhowells@redhat.com> Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Add a server capabilities NFS RPC op	David Howells	2006-09-22	1	-0/+1
\| \| \| \| \| \| \|	Add a set_capabilities NFS RPC op so that the server capabilities can be set. Signed-Off-By: David Howells <dhowells@redhat.com> Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Add a lookupfh NFS RPC op	David Howells	2006-09-22	1	-0/+3
\| \| \| \| \| \| \| \| \|	Add a lookup filehandle NFS RPC op so that a file handle can be looked up without requiring dentries and inodes and other VFS stuff when doing an NFS4 pathwalk during mounting. Signed-Off-By: David Howells <dhowells@redhat.com> Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	[PATCH] NFS: large non-page-aligned direct I/O clobbers memory	Trond Myklebust	2006-09-08	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The logic in nfs_direct_read_schedule and nfs_direct_write_schedule can allow data->npages to be one larger than rpages. This causes a page pointer to be written beyond the end of the pagevec in nfs_read_data (or nfs_write_data). Fix this by making nfs_(read\|write)_alloc() calculate the size of the pagevec array, and initialise data->npages. Also get rid of the redundant argument to nfs_commit_alloc(). Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com> Cc: Chuck Lever <chuck.lever@oracle.com> Signed-off-by: Andrew Morton <akpm@osdl.org> Signed-off-by: Linus Torvalds <torvalds@osdl.org>
*	NFSv4: increase client-provided nfs4 clientid size	J. Bruce Fields	2006-08-24	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	Neil Brown observed that the current limit of 32 bytes isn't enough to hold two ip addresses and the rest of the stuff we're putting in it, so it's often truncated to the point where it's unlikely to be unique. This can cause spurious CLID_INUSE's from the server. Signed-off-by: J. Bruce Fields <bfields@citi.umich.edu> Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com> (cherry picked from fc8c17ec251e984ab3df9182ed097aa5b577c915 commit)
*	Revert "Merge branch 'odirect'"	Trond Myklebust	2006-06-28	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \|	This reverts ccf01ef7aa9c6c293a1c64c27331a2ce227916ec commit. No idea how git managed this one: when I asked it to merge the odirect topic branch it actually generated a patch which reverted the change. Reverting the 'merge' will once again reveal Chuck's recent NFS/O_DIRECT work to the world. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	Merge branch 'odirect'	Trond Myklebust	2006-06-25	1	-2/+0
\|
*	NFS: Eliminate nfs_get_user_pages()	Chuck Lever	2006-06-24	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Neil Brown observed that the kmalloc() in nfs_get_user_pages() is more likely to fail if the I/O is large enough to require the allocation of more than a single page to keep track of all the pinned pages in the user's buffer. Instead of tracking one large page array per dreq/iocb, track pages per nfs_read/write_data, just like the cached I/O path does. An array for pages is already allocated for us by nfs_readdata_alloc() (and the write and commit equivalents). This is also required for adding support for vectored I/O to the NFS direct I/O path. The original reason to pin the user buffer and allocate all the NFS data structures before trying to schedule I/O was to ensure all needed resources are allocated on the client before starting to send requests. This reduces the chance that resource exhaustion on the client will cause a short read or write. On the other hand, for an application making very large application I/O requests, this means that it will be nearly impossible for the application to make forward progress on a resource-limited client. Thus, moving the buffer pinning functionality into the I/O scheduling loops should be good for scalability. The next patch will do the same for NFS data structure allocation. Signed-off-by: Chuck Lever <cel@netapp.com> Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFSv4: Follow a referral	Manoj Naik	2006-06-09	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \|	Respond to a moved error on NFS lookup by setting up the referral. Note: We don't actually follow the referral during lookup/getattr, but later when we detect fsid mismatch in inode revalidation (similar to the processing done for cloning submounts). Referrals will have fake attributes until they are actually followed or traversed. Signed-off-by: Manoj Naik <manoj@almaden.ibm.com> Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFSv4: convert fs-locations-components to conform to RFC3530	Manoj Naik	2006-06-09	1	-10/+20
\| \| \| \| \| \| \| \|	Use component4-style formats for decoding list of servers and pathnames in fs_locations. Signed-off-by: Manoj Naik <manoj@almaden.ibm.com> Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFSv4: Implement the fs_locations function call	Trond Myklebust	2006-06-09	1	-0/+24
\| \| \| \| \| \| \| \| \| \| \|	NFSv4 allows for the fact that filesystems may be replicated across several servers or that they may be migrated to a backup server in case of failure of the primary server. fs_locations is an NFSv4 operation for retrieving information about the location of migrated and/or replicated filesystems. Based on an initial implementation by Jiaying Zhang <jiayingz@citi.umich.edu> Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Store the file system "fsid" value in the NFS super block.	Trond Myklebust	2006-06-09	1	-7/+12
\| \| \| \| \| \| \|	This should enable us to detect if we are crossing a mountpoint in the case where the server is exporting "nohide" mounts. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Optimize allocation of nfs_read/write_data structures	Chuck Lever	2006-06-09	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Clean up use of page_array, and fix an off-by-one error noticed by Tom Talpey which causes kmalloc calls in cases where using the page_array is sufficient. Test plan: Normal client functional testing with r/wsize=32768. Signed-off-by: Chuck Lever <cel@netapp.com> Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Clean up inode metadata updates	Trond Myklebust	2006-06-09	1	-2/+1
\| \| \| \|	Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Cleanup of NFS read code	Trond Myklebust	2006-03-20	1	-1/+1
\| \| \| \| \| \| \| \|	Same callback hierarchy inversion as for the NFS write calls. This patch is not strictly speaking needed by the O_DIRECT code, but avoids confusing differences between the asynchronous read and write code. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Cleanup of NFS write code in preparation for asynchronous o_direct	Trond Myklebust	2006-03-20	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch inverts the callback hierarchy for NFS write calls. Instead of having the NFSv2/v3/v4-specific code set up the RPC callback ops, we allow the original caller to do so. This allows for more flexibility w.r.t. how to set up and tear down the nfs_write_data structure while still allowing the NFSv3/v4 code to perform error handling. The greater flexibility is needed by the asynchronous O_DIRECT code, which wants to be able to hold on to the original nfs_write_data structures after the WRITE RPC call has completed in order to be able to replay them if the COMMIT call determines that the server has rebooted. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFSv4: Ensure DELEGRETURN returns attributes	Trond Myklebust	2006-01-06	1	-0/+6
\| \| \| \| \| \| \| \|	Upon return of a write delegation, the server will almost always bump the change attribute. Ensure that we pick up that change so that we don't invalidate our data cache unnecessarily. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: support large reads and writes on the wire	Chuck Lever	2006-01-06	1	-13/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Most NFS server implementations allow up to 64KB reads and writes on the wire. The Solaris NFS server allows up to a megabyte, for instance. Now the Linux NFS client supports transfer sizes up to 1MB, too. This will help reduce protocol and context switch overhead on read/write intensive NFS workloads, and support larger atomic read and write operations on servers that support them. Test-plan: Connectathon and iozone on mount point with wsize=rsize>32768 over TCP. Tests with NFS over UDP to verify the maximum RPC payload size cap. Signed-off-by: Chuck Lever <cel@netapp.com> Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFSv4: locking XDR cleanup	Trond Myklebust	2006-01-06	1	-28/+24
\| \| \| \| \| \|	Get rid of some unnecessary intermediate structures Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFSv4: Make open_confirm() asynchronous too	Trond Myklebust	2006-01-06	1	-1/+1
\| \| \| \|	Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFSv4: Add post-op attributes to NFSv4 write and commit callbacks.	Trond Myklebust	2005-10-27	1	-0/+2
\| \| \| \|	Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFSv4: Add post-op attributes to nfs4_proc_remove()	Trond Myklebust	2005-10-27	1	-0/+7
\| \| \| \|	Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFSv4: Add post-op attributes to nfs4_proc_rename()	Trond Myklebust	2005-10-27	1	-0/+4
\| \| \| \|	Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFSv4: Add post-op attributes to nfs4_proc_link()	Trond Myklebust	2005-10-27	1	-0/+9
\| \| \| \| \| \| \|	Optimise attribute revalidation when hardlinking. Add post-op attributes for the directory and the original inode. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Add optional post-op getattr instruction to the NFSv4 file close.	Trond Myklebust	2005-10-27	1	-0/+3
\| \| \| \| \| \| \| \|	"Optional" means that the close call will not fail if the getattr at the end of the compound fails. If it does succeed, try to refresh inode attributes. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFSv4: Add directory post-op attributes to the CREATE operations.	Trond Myklebust	2005-10-27	1	-0/+2
\| \| \| \| \| \| \| \|	Since the directory attributes change every time we CREATE a file, we might as well pick up the new directory attributes in the same compound. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Optimise inode attribute cache updates	Trond Myklebust	2005-10-27	1	-1/+1
\| \| \| \| \| \| \|	Allow nfs_refresh_inode() also to update attributes on the inode if the RPC call was sent after the last call to nfs_update_inode(). Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFSv4: Eliminate nfsv4 open race...	Trond Myklebust	2005-10-18	1	-1/+1
\| \| \| \| \| \| \|	Make NFSv4 return the fully initialized file pointer with the stateid that it created in the lookup w/intent. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFSv4: Fix up handling of open_to_lock sequence ids	Trond Myklebust	2005-10-18	1	-15/+4
\| \| \| \|	Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFSv4: Make NFS clean up byte range locks asynchronously	Trond Myklebust	2005-10-18	1	-1/+1
\| \| \| \| \| \|	Currently we fail to do so if the process was signalled. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFSv4: Fix a potential CLOSE race	Trond Myklebust	2005-10-18	1	-1/+1
\| \| \| \| \| \| \| \| \| \|	Once the state_owner and lock_owner semaphores get removed, it will be possible for other OPEN requests to reopen the same file if they have lower sequence ids than our CLOSE call. This patch ensures that we recheck the file state once nfs_wait_on_sequence() has completed waiting. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFSv4: Add functions to order RPC calls	Trond Myklebust	2005-10-18	1	-7/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	NFSv4 file state-changing functions such as OPEN, CLOSE, LOCK,... are all labelled with "sequence identifiers" in order to prevent the server from reordering RPC requests, as this could cause its file state to become out of sync with the client. Currently the NFS client code enforces this ordering locally using semaphores to restrict access to structures until the RPC call is done. This, of course, only works with synchronous RPC calls, since the user process must first grab the semaphore. By dropping semaphores, and instead teaching the RPC engine to hold the RPC calls until they are ready to be sent, we can extend this process to work nicely with asynchronous RPC calls too. This patch adds a new list called "rpc_sequence" that defines the order of the RPC calls to be sent. We add one such list for each state_owner. When an RPC call is ready to be sent, it checks if it is top of the rpc_sequence list. If so, it proceeds. If not, it goes back to sleep, and loops until it hits top of the list. Once the RPC call has completed, it can then bump the sequence id counter, and remove itself from the rpc_sequence list, and then wake up the next sleeper. Note that the state_owner sequence ids and lock_owner sequence ids are all indexed to the same rpc_sequence list, so OPEN, LOCK,... requests are all ordered w.r.t. each other. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	[PATCH] NFS: Add support for NFSv3 ACLs	Andreas Gruenbacher	2005-06-22	1	-0/+27
\| \| \| \| \| \| \| \| \| \| \| \| \|	This adds acl support fo nfs clients via the NFSACL protocol extension, by implementing the getxattr, listxattr, setxattr, and removexattr iops for the system.posix_acl_access and system.posix_acl_default attributes. This patch implements a dumb version that uses no caching (and thus adds some overhead). (Another patch in this patchset adds caching as well.) Signed-off-by: Andreas Gruenbacher <agruen@suse.de> Acked-by: Olaf Kirch <okir@suse.de> Signed-off-by: Andrew Morton <akpm@osdl.org> Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	[PATCH] NFSv4: Client-side xdr for writing NFSv4 acls	J. Bruce Fields	2005-06-22	1	-0/+7
\| \| \| \| \| \| \| \|	Client-side support for NFSv4 acls: xdr encoding and decoding routines for writing acls Signed-off-by: J. Bruce Fields <bfields@citi.umich.edu> Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	[PATCH] NFSv4: Client-side xdr for reading NFSv4 acls	J. Bruce Fields	2005-06-22	1	-0/+7
\| \| \| \| \| \| \| \|	Client-side support for NFSv4 acls: xdr encoding and decoding routines for reading acls Signed-off-by: J. Bruce Fields <bfields@citi.umich.edu> Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>