op-kernel-dev - Development kernel branch for OpenPOWER systems

	Commit message (Collapse)	Author	Age	Files	Lines
*	nfsd4: allow large readdirs	J. Bruce Fields	2014-05-30	1	-3/+2
\| \| \| \| \| \| \| \|	Currently we limit readdir results to a single page. This can result in a performance regression compared to NFSv3 when reading large directories. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: more precise nfsd4_max_reply	J. Bruce Fields	2014-05-30	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \|	It will turn out to be useful to have a more accurate estimate of reply size; so, piggyback on the existing op reply-size estimators. Also move nfsd4_max_reply to nfs4proc.c to get easier access to struct nfsd4_operation and friends. (Thanks to Christoph Hellwig for pointing out that simplification.) Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: convert 4.1 replay encoding	J. Bruce Fields	2014-05-30	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Limits on maxresp_sz mean that we only ever need to replay rpc's that are contained entirely in the head. The one exception is very small zero-copy reads. That's an odd corner case as clients wouldn't normally ask those to be cached. in any case, this seems a little more robust. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: teach encoders to handle reserve_space failures	J. Bruce Fields	2014-05-30	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	We've tried to prevent running out of space with COMPOUND_SLACK_SPACE and special checking in those operations (getattr) whose result can vary enormously. However: - COMPOUND_SLACK_SPACE may be difficult to maintain as we add more protocol. - BUG_ON or page faulting on failure seems overly fragile. - Especially in the 4.1 case, we prefer not to fail compounds just because the returned result came close to session limits. (Though perfect enforcement here may be difficult.) - I'd prefer encoding to be uniform for all encoders instead of having special exceptions for encoders containing, for example, attributes. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: fix encoding of out-of-space replies	J. Bruce Fields	2014-05-27	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \|	If nfsd4_check_resp_size() returns an error then we should really be truncating the reply here, otherwise we may leave extra garbage at the end of the rpc reply. Also add a warning to catch any cases where our reply-size estimates may be wrong in the case of a non-idempotent operation. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: tweak nfsd4_encode_getattr to take xdr_stream	J. Bruce Fields	2014-05-23	1	-3/+4
\| \| \| \| \| \| \| \|	Just change the nfsd4_encode_getattr api. Not changing any code or adding any new functionality yet. Reviewed-by: Christoph Hellwig <hch@lst.de> Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: embed xdr_stream in nfsd4_compoundres	J. Bruce Fields	2014-05-23	1	-3/+1
\| \| \| \| \| \| \|	This is a mechanical transformation with no change in behavior. Reviewed-by: Christoph Hellwig <hch@lst.de> Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: nfsd4_replay_cache_entry should be static	J. Bruce Fields	2014-03-28	1	-2/+0
\| \| \| \| \| \|	This isn't actually used anywhere else. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	Define op_iattr for nfsd4_open instead using macro	Kinglong Mee	2014-01-06	1	-2/+1
\| \| \| \| \|	Signed-off-by: Kinglong Mee <kinglongmee@gmail.com> Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd: using nfsd4_encode_noop for encoding destroy_session/free_stateid	Kinglong Mee	2014-01-03	1	-1/+0
\| \| \| \| \| \| \| \|	Get rid of the extra code, using nfsd4_encode_noop for encoding destroy_session and free_stateid. And, delete unused argument (fr_status) int nfsd4_free_stateid. Signed-off-by: Kinglong Mee <kinglongmee@gmail.com> Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	NFSD: Server implementation of MAC Labeling	David Quigley	2013-05-15	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \|	Implement labeled NFS on the server: encoding and decoding, and writing and reading, of file labels. Enabled with CONFIG_NFSD_V4_SECURITY_LABEL. Signed-off-by: Matthew N. Dodd <Matthew.Dodd@sparta.com> Signed-off-by: Miguel Rodel Felipe <Rodel_FM@dsi.a-star.edu.sg> Signed-off-by: Phua Eu Gene <PHUA_Eu_Gene@dsi.a-star.edu.sg> Signed-off-by: Khin Mi Mi Aung <Mi_Mi_AUNG@dsi.a-star.edu.sg> Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: cleanup handling of nfsv4.0 closed stateid's	J. Bruce Fields	2013-04-08	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Closed stateid's are kept around a little while to handle close replays in the 4.0 case. So we stash them in the last-used stateid in the oo_last_closed_stateid field of the open owner. We can free that in encode_seqid_op_tail once the seqid on the open owner is next incremented. But we don't want to do that on the close itself; so we set NFS4_OO_PURGE_CLOSE flag set on the open owner, skip freeing it the first time through encode_seqid_op_tail, then when we see that flag set next time we free it. This is unnecessarily baroque. Instead, just move the logic that increments the seqid out of the xdr code and into the operation code itself. The justification given for the current placement is that we need to wait till the last minute to be sure we know whether the status is a sequence-id-mutating error or not, but examination of the code shows that can't actually happen. Reported-by: Yanchuan Nian <ycnian@gmail.com> Tested-by: Yanchuan Nian <ycnian@gmail.com> Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd: remove unused macro in nfsv4	Yanchuan Nian	2013-04-03	1	-1/+0
\| \| \| \| \| \| \| \|	lk_rflags is never used anywhere, and rflags is not defined in struct nfsd4_lock. Signed-off-by: Yanchuan Nian <ycnian@gmail.com> Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: handle seqid-mutating open errors from xdr decoding	J. Bruce Fields	2013-04-03	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	If a client sets an owner (or group_owner or acl) attribute on open for create, and the mapping of that owner to an id fails, then we return BAD_OWNER. But BAD_OWNER is a seqid-mutating error, so we can't shortcut the open processing that case: we have to at least look up the owner so we can find the seqid to bump. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: simplify nfsd4_encode_fattr interface slightly	J. Bruce Fields	2013-01-23	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	It seems slightly simpler to make nfsd4_encode_fattr rather than its callers responsible for advancing the write pointer on success. (Also: the count == 0 check in the verify case looks superfluous. Running out of buffer space is really the only reason fattr encoding should fail with eresource.) Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: disable zero-copy on non-final read ops	J. Bruce Fields	2012-12-17	1	-0/+8
\| \| \| \| \| \| \|	To ensure ordering of read data with any following operations, turn off zero copy if the read is not the final operation in the compound. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: delay filling in write iovec array till after xdr decoding	J. Bruce Fields	2012-11-26	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Our server rejects compounds containing more than one write operation. It's unclear whether this is really permitted by the spec; with 4.0, it's possibly OK, with 4.1 (which has clearer limits on compound parameters), it's probably not OK. No client that we're aware of has ever done this, but in theory it could be useful. The source of the limitation: we need an array of iovecs to pass to the write operation. In the worst case that array of iovecs could have hundreds of elements (the maximum rwsize divided by the page size), so it's too big to put on the stack, or in each compound op. So we instead keep a single such array in the compound argument. We fill in that array at the time we decode the xdr operation. But we decode every op in the compound before executing any of them. So once we've used that array we can't decode another write. If we instead delay filling in that array till the time we actually perform the write, we can reuse it. Another option might be to switch to decoding compound ops one at a time. I considered doing that, but it has a number of other side effects, and I'd rather fix just this one problem for now. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: move more write parameters into xdr argument	J. Bruce Fields	2012-11-26	1	-0/+2
\| \| \| \| \| \|	In preparation for moving some of this elsewhere. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd: use service net instead of hard-coded init_net	Stanislav Kinsbursky	2012-11-15	1	-1/+1
\| \| \| \| \| \| \| \|	This patch replaces init_net by SVC_NET(), where possible and also passes proper context to nested functions where required. Signed-off-by: Stanislav Kinsbursky <skinsbursky@parallels.com> Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: implement backchannel_ctl operation	J. Bruce Fields	2012-11-07	1	-0/+2
\| \| \| \| \| \|	This operation is mandatory for servers to implement. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: int/__be32 fixes	J. Bruce Fields	2012-05-31	1	-3/+3
\| \| \| \| \| \| \|	In each of these cases there's a simple unambiguous correct choice, and no actual bug. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	NFSD: Fix nfs4_verifier memory alignment	Chuck Lever	2012-03-20	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Clean up due to code review. The nfs4_verifier's data field is not guaranteed to be u32-aligned. Casting an array of chars to a u32 * is considered generally hazardous. We can fix most of this by using a __be32 array to generate the verifier's contents and then byte-copying it into the verifier field. However, there is one spot where there is a backwards compatibility constraint: the do_nfsd_create() call expects a verifier which is 32-bit aligned. Fix this spot by forcing the alignment of the create verifier in the nfsd4_open args structure. Also, sizeof(nfs4_verifer) is the size of the in-core verifier data structure, but NFS4_VERIFIER_SIZE is the number of octets in an XDR'd verifier. The two are not interchangeable, even if they happen to have the same value. Signed-off-by: Chuck Lever <chuck.lever@oracle.com> Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd41: implement NFS4_SHARE_WANT_NO_DELEG, NFS4_OPEN_DELEGATE_NONE_EXT, ↵	Benny Halevy	2012-02-17	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \|	why_no_deleg Respect client request for not getting a delegation in NFSv4.1 Appropriately return delegation "type" NFS4_OPEN_DELEGATE_NONE_EXT and WND4_NOT_WANTED reason. [nfsd41: add missing break when encoding op_why_no_deleg] Signed-off-by: Benny Halevy <bhalevy@tonian.com> Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	NFSD: Clean up the test_stateid function	Bryan Schumaker	2012-02-17	1	-2/+7
\| \| \| \| \| \| \| \| \| \| \|	When I initially wrote it, I didn't understand how lists worked so I wrote something that didn't use them. I think making a list of stateids to test is a more straightforward implementation, especially compared to especially compared to decoding stateids while simultaneously encoding a reply to the client. Signed-off-by: Bryan Schumaker <bjschuma@netapp.com> Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd41: split out share_access want and signal flags while decoding	Benny Halevy	2012-02-17	1	-2/+4
\| \| \| \| \|	Signed-off-by: Benny Halevy <bhalevy@tonian.com> Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd41: use current stateid by value	Tigran Mkrtchyan	2012-02-15	1	-2/+11
\| \| \| \| \|	Signed-off-by: Tigran Mkrtchyan <kofemann@gmail.com> Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd41: save and restore current stateid with current fh	Tigran Mkrtchyan	2012-02-15	1	-0/+1
\| \| \| \| \|	Signed-off-by: Tigran Mkrtchyan <kofemann@gmail.com> Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd41: handle current stateid in open and close	Tigran Mkrtchyan	2012-02-15	1	-0/+1
\| \| \| \| \|	Signed-off-by: Tigran Mkrtchyan <kofemann@gmail.com> Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: rearrange struct nfsd4_slot	J. Bruce Fields	2012-02-14	1	-1/+2
\| \| \| \| \| \| \| \| \| \| \| \| \|	Combine two booleans into a single flag field, move the smaller fields to the end. (In practice this doesn't make the struct any smaller. But we'll be adding another flag here soon.) Remove some debugging code that doesn't look useful, while we're in the neighborhood. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfs41: implement DESTROY_CLIENTID operation	Mi Jinlong	2011-10-24	1	-0/+5
\| \| \| \| \| \| \|	According to rfc5661 18.50, implement DESTROY_CLIENTID operation. Signed-off-by: Mi Jinlong <mijinlong@cn.fujitsu.com> Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: warn on open failure after create	J. Bruce Fields	2011-10-17	1	-1/+2
\| \| \| \| \| \| \| \| \| \|	If we create the object and then return failure to the client, we're left with an unexpected file in the filesystem. I'm trying to eliminate such cases but not 100% sure I have so an assertion might be helpful for now. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: preallocate open stateid in process_open1()	J. Bruce Fields	2011-10-17	1	-0/+1
\| \| \| \| \| \| \|	As with the nfs4_file, we'd prefer to find out about any failure before creating a new file rather than after. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: preallocate nfs4_file in process_open1()	J. Bruce Fields	2011-10-17	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Creating a new file is an irrevocable step--once it's visible in the filesystem, other processes may have seen it and done something with it, and unlinking it wouldn't simply undo the effects of the create. Therefore, in the case where OPEN creates a new file, we shouldn't do the create until we know that the rest of the OPEN processing will succeed. For example, we should preallocate a struct file in case we need it until waiting to allocate it till process_open2(), which is already too late. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: clean up open owners on OPEN failure	J. Bruce Fields	2011-10-17	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If process_open1() creates a new open owner, but the open later fails, the current code will leave the open owner around. It won't be on the close_lru list, and the client isn't expected to send a CLOSE, so it will hang around as long as the client does. Similarly, if process_open1() removes an existing open owner from the close lru, anticipating that an open owner that previously had no associated stateid's now will, but the open subsequently fails, then we'll again be left with the same leak. Fix both problems. Reported-by: Bryan Schumaker <bjschuma@netapp.com> Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: move name-length checks to xdr	J. Bruce Fields	2011-10-11	1	-2/+1
\| \| \| \| \| \|	Again, these checks are better in the xdr code. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: assume test_stateid always has session	J. Bruce Fields	2011-09-26	1	-1/+0
\| \| \| \| \| \| \| \|	Test_stateid is 4.1-only and only allowed after a sequence operation, so this check is unnecessary. Cc: Bryan Schumaker <bjschuma@netapp.com> Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd41: try to check reply size before operation	Mi Jinlong	2011-09-16	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	For checking the size of reply before calling a operation, we need try to get maxsize of the operation's reply. v3: using new method as Bruce said, "we could handle operations in two different ways: - For operations that actually change something (write, rename, open, close, ...), do it the way we're doing it now: be very careful to estimate the size of the response before even processing the operation. - For operations that don't change anything (read, getattr, ...) just go ahead and do the operation. If you realize after the fact that the response is too large, then return the error at that point. So we'd add another flag to op_flags: say, OP_MODIFIES_SOMETHING. And for operations with OP_MODIFIES_SOMETHING set, we'd do the first thing. For operations without it set, we'd do the second." Signed-off-by: Mi Jinlong <mijinlong@cn.fujitsu.com> [bfields@redhat.com: crash, don't attempt to handle, undefined op_rsize_bop] Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: split stateowners into open and lockowners	J. Bruce Fields	2011-09-07	1	-1/+1
\| \| \| \| \| \| \| \|	The stateowner has some fields that only make sense for openowners, and some that only make sense for lockowners, and I find it a lot clearer if those are separated out. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: eliminate unused lt_stateowner	J. Bruce Fields	2011-09-01	1	-1/+0
\| \| \| \| \| \|	This is used only as a local variable. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: drop most stateowner refcounting	J. Bruce Fields	2011-09-01	1	-1/+1
\| \| \| \| \| \| \|	Maybe we'll bring it back some day, but we don't have much real use for it now. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: cleanup seqid op stateowner usage	J. Bruce Fields	2011-08-31	1	-7/+0
\| \| \| \| \| \| \| \|	Now that the replay owner is in the cstate we can remove it from a lot of other individual operations and further simplify nfs4_preprocess_seqid_op(). Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: remove HAS_SESSION	J. Bruce Fields	2011-08-27	1	-1/+1
\| \| \| \| \| \|	This flag doesn't really buy us anything. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd: turn on reply cache for NFSv4	J. Bruce Fields	2011-07-18	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \|	It's sort of ridiculous that we've never had a working reply cache for NFSv4. On the other hand, we may still not: our current reply cache is likely not very good, especially in the TCP case (which is the only case that matters for v4). What we really need here is some serious testing. Anyway, here's a start. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: call nfsd4_release_compoundargs from pc_release	J. Bruce Fields	2011-07-18	1	-1/+1
\| \| \| \| \| \|	This simplifies cleanup a bit. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	NFSD: Added TEST_STATEID operation	Bryan Schumaker	2011-07-15	1	-0/+17
\| \| \| \| \| \| \| \|	This operation is used by the client to check the validity of a list of stateids. Signed-off-by: Bryan Schumaker <bjschuma@netapp.com> Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	NFSD: added FREE_STATEID operation	Bryan Schumaker	2011-07-15	1	-0/+8
\| \| \| \| \| \| \| \|	This operation is used by the client to tell the server to free a stateid. Signed-off-by: Bryan Schumaker <bjschuma@netapp.com> Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: set sequence flag when backchannel is down	J. Bruce Fields	2011-01-11	1	-1/+1
\| \| \| \| \| \|	Implement the SEQ4_STATUS_CB_PATH_DOWN flag. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: support BIND_CONN_TO_SESSION	J. Bruce Fields	2011-01-11	1	-0/+2
\| \| \| \| \| \| \|	Basic xdr and processing for BIND_CONN_TO_SESSION. This adds a connection to the list of connections associated with a session. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd4: implement secinfo_no_name	J. Bruce Fields	2010-12-17	1	-0/+5
\| \| \| \| \| \|	Implementation of this operation is mandatory for NFSv4.1. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
*	nfsd: Fix possible BUG_ON firing in set_change_info	Neil Brown	2010-12-08	1	-11/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	If vfs_getattr in fill_post_wcc returns an error, we don't set fh_post_change. For NFSv4, this can result in set_change_info triggering a BUG_ON. i.e. fh_post_saved being zero isn't really a bug. So: - instead of BUGging when fh_post_saved is zero, just clear ->atomic. - if vfs_getattr fails in fill_post_wcc, take a copy of i_ctime anyway. This will be used i seg_change_info, but not overly trusted. - While we are there, remove the pointless 'if' statements in set_change_info. There is no harm setting all the values. Signed-off-by: NeilBrown <neilb@suse.de> Cc: stable@kernel.org Signed-off-by: J. Bruce Fields <bfields@redhat.com>