op-kernel-dev - Development kernel branch for OpenPOWER systems

	Commit message (Collapse)	Author	Age	Files	Lines
*	NFS: Throttle page dirtying while we're flushing to disk	Trond Myklebust	2009-03-11	1	-11/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The following patch is a combination of a patch by myself and Peter Staubach. Trond: If we allow other processes to dirty pages while a process is doing a consistency sync to disk, we can end up never making progress. Peter: Attached is a patch which addresses a continuing problem with the NFS client generating out of order WRITE requests. While this is compliant with all of the current protocol specifications, there are servers in the market which can not handle out of order WRITE requests very well. Also, this may lead to sub-optimal block allocations in the underlying file system on the server. This may cause the read throughputs to be reduced when reading the file from the server. Peter: There has been a lot of work recently done to address out of order issues on a systemic level. However, the NFS client is still susceptible to the problem. Out of order WRITE requests can occur when pdflush is in the middle of writing out pages while the process dirtying the pages calls generic_file_buffered_write which calls generic_perform_write which calls balance_dirty_pages_rate_limited which ends up calling writeback_inodes which ends up calling back into the NFS client to writes out dirty pages for the same file that pdflush happens to be working with. Signed-off-by: Peter Staubach <staubach@redhat.com> [modification by Trond to merge the two similar patches] Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	Merge branch 'task_killable' of ↵	Linus Torvalds	2008-02-01	1	-14/+5
\|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	git://git.kernel.org/pub/scm/linux/kernel/git/willy/misc * 'task_killable' of git://git.kernel.org/pub/scm/linux/kernel/git/willy/misc: (22 commits) Remove commented-out code copied from NFS NFS: Switch from intr mount option to TASK_KILLABLE Add wait_for_completion_killable Add wait_event_killable Add schedule_timeout_killable Use mutex_lock_killable in vfs_readdir Add mutex_lock_killable Use lock_page_killable Add lock_page_killable Add fatal_signal_pending Add TASK_WAKEKILL exit: Use task_is_* signal: Use task_is_* sched: Use task_contributes_to_load, TASK_ALL and TASK_NORMAL ptrace: Use task_is_* power: Use task_is_* wait: Use TASK_NORMAL proc/base.c: Use task_is_* proc/array.c: Use TASK_REPORT perfmon: Use task_is_* ... Fixed up conflicts in NFS/sunrpc manually..
\| *	NFS: Switch from intr mount option to TASK_KILLABLE	Matthew Wilcox	2007-12-06	1	-13/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	By using the TASK_KILLABLE infrastructure, we can get rid of the 'intr' mount option. We have to use _killable everywhere instead of _interruptible as we get rid of rpc_clnt_sigmask/sigunmask. Signed-off-by: Liam R. Howlett <howlett@gmail.com> Signed-off-by: Matthew Wilcox <willy@linux.intel.com>
* \|	NFS: Clean up the write request locking.	Trond Myklebust	2008-01-30	1	-5/+8
\|/ \| \| \| \| \|	Ensure that we set/clear NFS_PAGE_TAG_LOCKED when the nfs_page is hashed. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	mm: Remove slab destructors from kmem_cache_create().	Paul Mundt	2007-07-20	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Slab destructors were no longer supported after Christoph's c59def9f222d44bb7e2f0a559f2906191a0862d7 change. They've been BUGs for both slab and slub, and slob never supported them either. This rips out support for the dtor pointer from kmem_cache_create() completely and fixes up every single callsite in the kernel (there were about 224, not including the slab allocator definitions themselves, or the documentation references). Signed-off-by: Paul Mundt <lethal@linux-sh.org>
*	NFS: Replace NFS_I(inode)->req_lock with inode->i_lock	Trond Myklebust	2007-07-10	1	-5/+6
\| \| \| \| \| \| \|	There is no justification for keeping a special spinlock for the exclusive use of the NFS writeback code. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: reduce latency by using conditional rescheduling in nfs_scan_list	Trond Myklebust	2007-07-10	1	-1/+2
\| \| \| \|	Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Prevent integer overflow in nfs_scan_list()	Trond Myklebust	2007-07-10	1	-0/+2
\| \| \| \| \| \| \|	Also ensure that nfs_inode ncommit and npages are large enough to represent all possible values for the number of pages. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS cleanup: speed up nfs_scan_commit using radix tree tags	Trond Myklebust	2007-07-10	1	-7/+7
\| \| \| \| \| \|	Add a tag for requests that are waiting for a COMMIT Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS cleanup: Rename NFS_PAGE_TAG_WRITEBACK to NFS_PAGE_TAG_LOCKED	Trond Myklebust	2007-07-10	1	-7/+7
\| \| \| \|	Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Convert struct nfs_page to use krefs	Trond Myklebust	2007-07-10	1	-6/+8
\| \| \| \|	Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Replace vfsmount and dentry in nfs_open_context with struct path	Trond Myklebust	2007-07-10	1	-3/+3
\| \| \| \|	Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Avoid a deadlock situation on write	Trond Myklebust	2007-05-24	1	-0/+20
\| \| \| \| \| \| \| \| \| \| \| \| \|	When processes are allowed to attempt to lock a non-contiguous range of nfs write requests, it is possible for generic_writepages to 'wrap round' the address space, and call writepage() on a request that is already locked by the same process. We avoid the deadlock by checking if the page index is contiguous with the list of nfs write requests that is already held in our nfs_pageio_descriptor prior to attempting to lock a new request. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	Detach sched.h from mm.h	Alexey Dobriyan	2007-05-21	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	First thing mm.h does is including sched.h solely for can_do_mlock() inline function which has "current" dereference inside. By dealing with can_do_mlock() mm.h can be detached from sched.h which is good. See below, why. This patch a) removes unconditional inclusion of sched.h from mm.h b) makes can_do_mlock() normal function in mm/mlock.c c) exports can_do_mlock() to not break compilation d) adds sched.h inclusions back to files that were getting it indirectly. e) adds less bloated headers to some files (asm/signal.h, jiffies.h) that were getting them indirectly Net result is: a) mm.h users would get less code to open, read, preprocess, parse, ... if they don't need sched.h b) sched.h stops being dependency for significant number of files: on x86_64 allmodconfig touching sched.h results in recompile of 4083 files, after patch it's only 3744 (-8.3%). Cross-compile tested on all arm defconfigs, all mips defconfigs, all powerpc defconfigs, alpha alpha-up arm i386 i386-up i386-defconfig i386-allnoconfig ia64 ia64-up m68k mips parisc parisc-up powerpc powerpc-up s390 s390-up sparc sparc-up sparc64 sparc64-up um-x86_64 x86_64 x86_64-up x86_64-defconfig x86_64-allnoconfig as well as my two usual configs. Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
*	NFS: Kill the obsolete NFS_PARANOIA	Jesper Juhl	2007-05-09	1	-7/+0
\| \| \| \| \| \| \|	Signed-off-by: Jesper Juhl <jesper.juhl@gmail.com> Acked-by: Trond Myklebust <Trond.Myklebust@netapp.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Fix a compile glitch on 64-bit systems	Trond Myklebust	2007-05-04	1	-1/+1
\| \| \| \| \| \| \| \| \|	fs/nfs/pagelist.c:226: error: conflicting types for 'nfs_pageio_init' include/linux/nfs_page.h:80: error: previous declaration of 'nfs_pageio_init' was here Thanks to Andrew for spotting this... Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Clean up nfs_create_request comments	Jason Uhlenkott	2007-05-02	1	-7/+1
\| \| \| \| \| \| \|	Remove some stale comments about hard limits which went away in 2.5. Signed-off-by: Jason Uhlenkott <juhlenko@akamai.com> Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Use pgoff_t in structures and functions that pass page cache offsets	Trond Myklebust	2007-04-30	1	-2/+2
\| \| \| \|	Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Fix a buffer overflow in the allocation of struct nfs_read/writedata	Trond Myklebust	2007-04-30	1	-2/+8
\| \| \| \|	Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Fix a race when doing NFS write coalescing	Trond Myklebust	2007-04-30	1	-92/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently we do write coalescing in a very inefficient manner: one pass in generic_writepages() in order to lock the pages for writing, then one pass in nfs_flush_mapping() and/or nfs_sync_mapping_wait() in order to gather the locked pages for coalescing into RPC requests of size "wsize". In fact, it turns out there is actually a deadlock possible here since we only start I/O on the second pass. If the user signals the process while we're in nfs_sync_mapping_wait(), for instance, then we may exit before starting I/O on all the requests that have been queued up. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Cleanup for nfs_readpages()	Trond Myklebust	2007-04-30	1	-2/+2
\| \| \| \| \| \| \|	Do the coalescing of read requests into block sized requests at start of I/O as we scan through the pages instead of going through a second pass. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Another cleanup of the read/write request coalescing code	Trond Myklebust	2007-04-30	1	-5/+66
\| \| \| \|	Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Cleanup the coalescing code	Trond Myklebust	2007-04-30	1	-32/+85
\| \| \| \|	Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	Merge branch 'master' of /home/trondmy/kernel/linux-2.6/ into merge_linus	Trond Myklebust	2006-12-07	1	-2/+2
\|\
\| *	[PATCH] slab: remove kmem_cache_t	Christoph Lameter	2006-12-07	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Replace all uses of kmem_cache_t with struct kmem_cache. The patch was generated using the following script: #!/bin/sh # # Replace one string by another in all the kernel sources. # set -e for file in `find * -name ".c" -o -name ".h"\|xargs grep -l $1`; do quilt add $file sed -e "1,\$s/$1/$2/g" $file >/tmp/$$ mv /tmp/$$ $file quilt refresh done The script was run like this sh replace kmem_cache_t "struct kmem_cache" Signed-off-by: Christoph Lameter <clameter@sgi.com> Signed-off-by: Andrew Morton <akpm@osdl.org> Signed-off-by: Linus Torvalds <torvalds@osdl.org>
\| *	[PATCH] slab: remove SLAB_KERNEL	Christoph Lameter	2006-12-07	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SLAB_KERNEL is an alias of GFP_KERNEL. Signed-off-by: Christoph Lameter <clameter@sgi.com> Signed-off-by: Andrew Morton <akpm@osdl.org> Signed-off-by: Linus Torvalds <torvalds@osdl.org>
* \|	NFS: Fix up writeback_control->nr_to_write accounting	Trond Myklebust	2006-12-06	1	-9/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	We're really accounting for the same page twice now: once in generic_writepages(), and once in nfs_scan_dirty(). Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
* \|	NFS: Fix up the dirty page accounting	Trond Myklebust	2006-12-06	1	-1/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There is now no reason to account for the dirty pages in the NFS code, since the VM code will now do it for us via __set_page_dirty_nobuffers(), and set_page_writeback(). We still need to keep the accounting of stable writes, though. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
* \|	NFS: Clean up nfs_scan_dirty()	Trond Myklebust	2006-12-06	1	-25/+51
\|/ \| \| \| \| \|	Pass down struct writeback control. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	[PATCH] Really ignore kmem_cache_destroy return value	Alexey Dobriyan	2006-09-27	1	-2/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* Rougly half of callers already do it by not checking return value * Code in drivers/acpi/osl.c does the following to be sure: (void)kmem_cache_destroy(cache); * Those who check it printk something, however, slab_error already printed the name of failed cache. * XFS BUGs on failed kmem_cache_destroy which is not the decision low-level filesystem driver should make. Converted to ignore. Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: Andrew Morton <akpm@osdl.org> Signed-off-by: Linus Torvalds <torvalds@osdl.org>
*	Merge git://git.kernel.org/pub/scm/linux/kernel/git/bunk/trivial	Linus Torvalds	2006-06-30	1	-1/+0
\|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	* git://git.kernel.org/pub/scm/linux/kernel/git/bunk/trivial: Remove obsolete #include <linux/config.h> remove obsolete swsusp_encrypt arch/arm26/Kconfig typos Documentation/IPMI typos Kconfig: Typos in net/sched/Kconfig v9fs: do not include linux/version.h Documentation/DocBook/mtdnand.tmpl: typo fixes typo fixes: specfic -> specific typo fixes in Documentation/networking/pktgen.txt typo fixes: occuring -> occurring typo fixes: infomation -> information typo fixes: disadvantadge -> disadvantage typo fixes: aquire -> acquire typo fixes: mecanism -> mechanism typo fixes: bandwith -> bandwidth fix a typo in the RTC_CLASS help text smb is no longer maintained Manually merged trivial conflict in arch/um/kernel/vmlinux.lds.S
\| *	Remove obsolete #include <linux/config.h>	Jörn Engel	2006-06-30	1	-1/+0
\| \| \| \| \| \| \| \| \| \|	Signed-off-by: Jörn Engel <joern@wohnheim.fh-wedel.de> Signed-off-by: Adrian Bunk <bunk@stusta.de>
* \|	[PATCH] zoned vm counters: conversion of nr_dirty to per zone counter	Christoph Lameter	2006-06-30	1	-0/+1
\|/ \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This makes nr_dirty a per zone counter. Looping over all processors is avoided during writeback state determination. The counter aggregation for nr_dirty had to be undone in the NFS layer since we summed up the page counts from multiple zones. Someone more familiar with NFS should probably review what I have done. [akpm@osdl.org: bugfix] Signed-off-by: Christoph Lameter <clameter@sgi.com> Cc: Trond Myklebust <trond.myklebust@fys.uio.no> Signed-off-by: Andrew Morton <akpm@osdl.org> Signed-off-by: Linus Torvalds <torvalds@osdl.org>
*	[PATCH] fix static linking of NFS	David Brownell	2006-06-27	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Builds on ARM report link problems with common configurations like statically linked NFS (for nfsroot). The symptom is that __init section code references __exit section code; that won't work since the exit sections are discarded (since they can never be called). The best fix for these particular cases would be an "__init_or_exit" section annotation. Signed-off-by: David Brownell <dbrownell@users.sourceforge.net> Acked-by: Trond Myklebust <trond.myklebust@fys.uio.no> Signed-off-by: Linus Torvalds <torvalds@osdl.org>
*	NFS: Split fs/nfs/inode.c	David Howells	2006-06-09	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	As fs/nfs/inode.c is rather large, heterogenous and unwieldy, the attached patch splits it up into a number of files: () fs/nfs/inode.c Strictly inode specific functions. () fs/nfs/super.c Superblock management functions for NFS and NFS4, normal access, clones and referrals. The NFS4 superblock functions _could_ move out into a separate conditionally compiled file, but it's probably not worth it as there're so many common bits. () fs/nfs/namespace.c Some namespace-specific functions have been moved here. () fs/nfs/nfs4namespace.c NFS4-specific namespace functions (this could be merged into the previous file). This file is conditionally compiled. () fs/nfs/internal.h Inter-file declarations, plus a few simple utility functions moved from fs/nfs/inode.c. Additionally, all the in-.c-file externs have been moved here, and those files they were moved from now includes this file. For the most part, the functions have not been changed, only some multiplexor functions have changed significantly. I've also: () Added some extra banner comments above some functions. () Rearranged the function order within the files to be more logical and better grouped (IMO), though someone may prefer a different order. () Reduced the number of #ifdefs in .c files. (*) Added missing __init and __exit directives. Signed-Off-By: David Howells <dhowells@redhat.com>
*	NFS: Flesh out nfs_invalidate_page()	Trond Myklebust	2006-06-09	1	-19/+28
\| \| \| \| \| \| \|	In the case of a call to truncate_inode_pages(), we should really try to cancel any pending writes on the page. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Fix a race with PG_private and nfs_release_page()	Trond Myklebust	2006-03-20	1	-5/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	We don't need to set PG_private for readahead pages, since they never get unlocked while I/O is in progress. However there is a small race in nfs_readpage_release() whereby the page may be unlocked, and have PG_private set. Fix is to have PG_private set only for the case of writes... Also fix a bug in nfs_clear_page_writeback(): Don't attempt to clear the radix_tree tag if we've already deleted the radix tree entry. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	NFS: Avoid races between writebacks and truncation	Trond Myklebust	2006-03-20	1	-2/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently, there is no serialisation between NFS asynchronous writebacks and truncation at the page level due to the fact that nfs_sync_inode() cannot lock the pages that it is about to write out. This means that it is possible to be flushing out data (and calling something like set_page_writeback()) while the page cache is busy evicting the page. Oops... Use the hooks provided in try_to_release_page() to ensure that dirty pages are always written back to storage before we evict them. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	[PATCH] NFS: Replace nfs_page insertion sort with a radix sort	Trond Myklebust	2005-06-22	1	-30/+56
\| \| \| \|	Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	[PATCH] NFS: Make searching and waiting on busy writeback requests more ↵	Trond Myklebust	2005-06-22	1	-1/+28
\| \| \| \| \| \| \| \| \|	efficient. Basically copies the VFS's method for tracking writebacks and applies it to the struct nfs_page. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	[PATCH] NFS: cleanup: shrink struct nfs_open_context	Trond Myklebust	2005-06-22	1	-7/+28
\| \| \| \| \| \| \|	Remove the wait queue, and replace the functions that depended on it with wait_on_bit(). Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
*	Linux-2.6.12-rc2v2.6.12-rc2	Linus Torvalds	2005-04-16	1	-0/+309
	Initial git repository build. I'm not bothering with the full history, even though we have it. We can create a separate "historical" git archive of that later if we want to, and in the meantime it's about 3.2GB when imported into git - space that would just make the early git days unnecessarily complicated, when we don't have a lot of good infrastructure for it. Let it rip!