op-kernel-dev - Development kernel branch for OpenPOWER systems

	Commit message (Collapse)	Author	Age	Files	Lines
*	netfilter: nf_conntrack: use SLAB_DESTROY_BY_RCU and get rid of call_rcu()	Eric Dumazet	2009-03-25	1	-2/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Use "hlist_nulls" infrastructure we added in 2.6.29 for RCUification of UDP & TCP. This permits an easy conversion from call_rcu() based hash lists to a SLAB_DESTROY_BY_RCU one. Avoiding call_rcu() delay at nf_conn freeing time has numerous gains. First, it doesnt fill RCU queues (up to 10000 elements per cpu). This reduces OOM possibility, if queued elements are not taken into account This reduces latency problems when RCU queue size hits hilimit and triggers emergency mode. - It allows fast reuse of just freed elements, permitting better use of CPU cache. - We delete rcu_head from "struct nf_conn", shrinking size of this structure by 8 or 16 bytes. This patch only takes care of "struct nf_conn". call_rcu() is still used for less critical conntrack parts, that may be converted later if necessary. Signed-off-by: Eric Dumazet <dada1@cosmosbay.com> Signed-off-by: Patrick McHardy <kaber@trash.net>
*	netns: ipmr: declare reg_vif_num per-namespace	Benjamin Thery	2009-01-22	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \|	Preliminary work to make IPv4 multicast routing netns-aware. Declare variable 'reg_vif_num' per-namespace, move into struct netns_ipv4. At the moment, this variable is only referenced in init_net. Signed-off-by: Benjamin Thery <benjamin.thery@bull.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	netns: ipmr: declare mroute_do_assert and mroute_do_pim per-namespace	Benjamin Thery	2009-01-22	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Preliminary work to make IPv4 multicast routing netns-aware. Declare IPv multicast routing variables 'mroute_do_assert' and 'mroute_do_pim' per-namespace in struct netns_ipv4. At the moment, these variables are only referenced in init_net. Signed-off-by: Benjamin Thery <benjamin.thery@bull.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	netns: ipmr: declare counter cache_resolve_queue_len per-namespace	Benjamin Thery	2009-01-22	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Preliminary work to make IPv4 multicast routing netns-aware. Declare variable cache_resolve_queue_len per-namespace: move it into struct netns_ipv4. This variable counts the number of unresolved cache entries queued in the list mfc_unres_queue. This list is kept global to all netns as the number of entries per namespace is limited to 10 (hardcoded in routine ipmr_cache_unresolved). Entries belonging to different namespaces in mfc_unres_queue will be identified by matching the mfc_net member introduced previously in struct mfc_cache. Keeping this list global to all netns, also allows us to keep a single timer (ipmr_expire_timer) to handle their expiration. In some places cache_resolve_queue_len value was tested for arming or deleting the timer. These tests were equivalent to testing mfc_unres_queue value instead and are replaced in this patch. At the moment, cache_resolve_queue_len is only referenced in init_net. Signed-off-by: Benjamin Thery <benjamin.thery@bull.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	netns: ipmr: dynamically allocate mfc_cache_array	Benjamin Thery	2009-01-22	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \|	Preliminary work to make IPv4 multicast routing netns-aware. Dynamically allocate IPv4 multicast forwarding cache, mfc_cache_array, and move it to struct netns_ipv4. At the moment, mfc_cache_array is only referenced in init_net. Signed-off-by: Benjamin Thery <benjamin.thery@bull.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	netns: ipmr: dynamically allocate vif_table	Benjamin Thery	2009-01-22	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Preliminary work to make IPv6 multicast routing netns-aware. Dynamically allocate interface table vif_table and move it to struct netns_ipv4, and update MIF_EXISTS() macro. At the moment, vif_table is only referenced in init_net. Signed-off-by: Benjamin Thery <benjamin.thery@bull.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	netns: ipmr: allocate mroute_socket per-namespace.	Benjamin Thery	2009-01-22	1	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \|	Preliminary work to make IPv4 multicast routing netns-aware. Make IPv4 multicast routing mroute_socket per-namespace, moves it into struct netns_ipv4. At the moment, mroute_socket is only referenced in init_net. Signed-off-by: Benjamin Thery <benjamin.thery@bull.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	netns: ip6mr: declare reg_vif_num per-namespace	Benjamin Thery	2008-12-10	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \|	Preliminary work to make IPv6 multicast forwarding netns-aware. Declare variable 'reg_vif_num' per-namespace, moves into struct netns_ipv6. At the moment, this variable is only referenced in init_net. Signed-off-by: Benjamin Thery <benjamin.thery@bull.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	netns: ip6mr: declare mroute_do_assert and mroute_do_pim per-namespace	Benjamin Thery	2008-12-10	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Preliminary work to make IPv6 multicast forwarding netns-aware. Declare IPv6 multicast forwarding variables 'mroute_do_assert' and 'mroute_do_pim' per-namespace in struct netns_ipv6. At the moment, these variables are only referenced in init_net. Signed-off-by: Benjamin Thery <benjamin.thery@bull.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	netns: ip6mr: declare counter cache_resolve_queue_len per-namespace	Benjamin Thery	2008-12-10	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Preliminary work to make IPv6 multicast forwarding netns-aware. Declare variable cache_resolve_queue_len per-namespace: moves it into struct netns_ipv6. This variable counts the number of unresolved cache entries queued in the list mfc_unres_queue. This list is kept global to all netns as the number of entries per namespace is limited to 10 (hardcoded in routine ip6mr_cache_unresolved). Entries belonging to different namespaces in mfc_unres_queue will be identified by matching the mfc_net member introduced previously in struct mfc6_cache. Keeping this list global to all netns, also allows us to keep a single timer (ipmr_expire_timer) to handle their expiration. In some places cache_resolve_queue_len value was tested for arming or deleting the timer. These tests were equivalent to testing mfc_unres_queue value instead and are replaced in this patch. At the moment, cache_resolve_queue_len is only referenced in init_net. Signed-off-by: Benjamin Thery <benjamin.thery@bull.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	netns: ip6mr: dynamically allocate mfc6_cache_array	Benjamin Thery	2008-12-10	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Preliminary work to make IPv6 multicast forwarding netns-aware. Dynamically allocates IPv6 multicast forwarding cache, mfc6_cache_array, and moves it to struct netns_ipv6. At the moment, mfc6_cache_array is only referenced in init_net. Replace 'ARRAY_SIZE(mfc6_cache_array)' with mfc6_cache_array size: MFC6_LINES. Signed-off-by: Benjamin Thery <benjamin.thery@bull.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	netns: ip6mr: dynamically allocates vif6_table	Benjamin Thery	2008-12-10	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \|	Preliminary work to make IPv6 multicast forwarding netns-aware. Dynamically allocates interface table vif6_table and moves it to struct netns_ipv6, and updates MIF_EXISTS() macro. At the moment, vif6_table is only referenced in init_net. Signed-off-by: Benjamin Thery <benjamin.thery@bull.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	netns: ip6mr: allocate mroute6_socket per-namespace.	Benjamin Thery	2008-12-10	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \|	Preliminary work to make IPv6 multicast forwarding netns-aware. Make IPv6 multicast forwarding mroute6_socket per-namespace, moves it into struct netns_ipv6. At the moment, mroute6_socket is only referenced in init_net. Signed-off-by: Benjamin Thery <benjamin.thery@bull.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	Merge branch 'master' of ↵	David S. Miller	2008-11-28	1	-0/+5
\|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \|	git://git.kernel.org/pub/scm/linux/kernel/git/kaber/nf-next-2.6 Conflicts: net/netfilter/nf_conntrack_netlink.c
\| *	netfilter: netns ebtables: ebtable_nat in netns	Alexey Dobriyan	2008-11-04	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: Patrick McHardy <kaber@trash.net>
\| *	netfilter: netns ebtables: ebtable_filter in netns	Alexey Dobriyan	2008-11-04	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: Patrick McHardy <kaber@trash.net>
\| *	netfilter: netns ebtables: ebtable_broute in netns	Alexey Dobriyan	2008-11-04	1	-0/+3
\| \| \| \| \| \| \| \| \| \|	Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: Patrick McHardy <kaber@trash.net>
* \|	netns xfrm: per-netns sysctls	Alexey Dobriyan	2008-11-25	1	-0/+10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Make net.core.xfrm_aevent_etime net.core.xfrm_acq_expires net.core.xfrm_aevent_rseqth net.core.xfrm_larval_drop sysctls per-netns. For that make net_core_path[] global, register it to prevent two /proc/net/core antries and change initcall position -- xfrm_init() is called from fs_initcall, so this one should be fs_initcall at least. Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
* \|	netns xfrm: per-netns MIBs	Alexey Dobriyan	2008-11-25	1	-0/+3
\| \| \| \| \| \| \| \| \| \|	Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
* \|	netns xfrm: per-netns NETLINK_XFRM socket	Alexey Dobriyan	2008-11-25	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Stub senders to init_net's one temporarily. Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
* \|	netns xfrm: per-netns policy hash resizing work	Alexey Dobriyan	2008-11-25	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
* \|	netns xfrm: per-netns policy counts	Alexey Dobriyan	2008-11-25	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
* \|	netns xfrm: per-netns xfrm_policy_bydst hash	Alexey Dobriyan	2008-11-25	1	-0/+6
\| \| \| \| \| \| \| \| \| \|	Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
* \|	netns xfrm: per-netns inexact policies	Alexey Dobriyan	2008-11-25	1	-0/+2
\| \| \| \| \| \| \| \| \| \|	Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
* \|	netns xfrm: per-netns xfrm_policy_byidx hashmask	Alexey Dobriyan	2008-11-25	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Per-netns hashes are independently resizeable. Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
* \|	netns xfrm: per-netns xfrm_policy_byidx hash	Alexey Dobriyan	2008-11-25	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
* \|	netns xfrm: per-netns policy list	Alexey Dobriyan	2008-11-25	1	-0/+2
\| \| \| \| \| \| \| \| \| \|	Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
* \|	netns xfrm: per-netns km_waitq	Alexey Dobriyan	2008-11-25	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Disallow spurious wakeups in __xfrm_lookup(). Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
* \|	netns xfrm: per-netns state GC work	Alexey Dobriyan	2008-11-25	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	State GC is per-netns, and this is part of it. Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
* \|	netns xfrm: per-netns state GC list	Alexey Dobriyan	2008-11-25	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	km_waitq is going to be made per-netns to disallow spurious wakeups in __xfrm_lookup(). To not wakeup after every garbage-collected xfrm_state (which potentially can be from different netns) make state GC list per-netns. Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
* \|	netns xfrm: per-netns xfrm_hash_work	Alexey Dobriyan	2008-11-25	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	All of this is implicit passing which netns's hashes should be resized. Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
* \|	netns xfrm: per-netns xfrm_state counts	Alexey Dobriyan	2008-11-25	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
* \|	netns xfrm: per-netns xfrm_state_hmask	Alexey Dobriyan	2008-11-25	1	-0/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Since hashtables are per-netns, they can be independently resized. Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
* \|	netns xfrm: per-netns xfrm_state_byspi hash	Alexey Dobriyan	2008-11-25	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
* \|	netns xfrm: per-netns xfrm_state_bysrc hash	Alexey Dobriyan	2008-11-25	1	-0/+1
\| \| \| \| \| \| \| \| \| \|	Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
* \|	netns xfrm: per-netns xfrm_state_bydst hash	Alexey Dobriyan	2008-11-25	1	-0/+9
\| \| \| \| \| \| \| \| \| \|	Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
* \|	netns xfrm: per-netns xfrm_state_all list	Alexey Dobriyan	2008-11-25	1	-0/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is done to get a) simple "something leaked" check b) cover possible DoSes when other netns puts many, many xfrm_states onto a list. c) not miss "alien xfrm_state" check in some of list iterators in future. Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
* \|	netns xfrm: add netns boilerplate	Alexey Dobriyan	2008-11-25	1	-0/+7
\|/ \| \| \| \|	Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
*	net: implement emergency route cache rebulds when gc_elasticity is exceeded	Neil Horman	2008-10-27	1	-0/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a patch to provide on demand route cache rebuilding. Currently, our route cache is rebulid periodically regardless of need. This introduced unneeded periodic latency. This patch offers a better approach. Using code provided by Eric Dumazet, we compute the standard deviation of the average hash bucket chain length while running rt_check_expire. Should any given chain length grow to larger that average plus 4 standard deviations, we trigger an emergency hash table rebuild for that net namespace. This allows for the common case in which chains are well behaved and do not grow unevenly to not incur any latency at all, while those systems (which may be being maliciously attacked), only rebuild when the attack is detected. This patch take 2 other factors into account: 1) chains with multiple entries that differ by attributes that do not affect the hash value are only counted once, so as not to unduly bias system to rebuilding if features like QOS are heavily used 2) if rebuilding crosses a certain threshold (which is adjustable via the added sysctl in this patch), route caching is disabled entirely for that net namespace, since constant rebuilding is less efficient that no caching at all Tested successfully by me. Signed-off-by: Neil Horman <nhorman@tuxdriver.com> Signed-off-by: Eric Dumazet <dada1@cosmosbay.com> Signed-off-by: David S. Miller <davem@davemloft.net>
*	netfilter: netns: use NFPROTO_NUMPROTO instead of NUMPROTO for tables array	Patrick McHardy	2008-10-20	1	-2/+2
\| \| \| \| \| \| \|	The netfilter families have been decoupled from regular protocol families. Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	ipv6: making ip and icmp statistics per/namespace	Denis V. Lunev	2008-10-08	1	-0/+3
\| \| \| \| \|	Signed-off-by: Denis V. Lunev <den@openvz.org> Signed-off-by: David S. Miller <davem@davemloft.net>
*	netfilter: netns nat: per-netns bysource hash	Alexey Dobriyan	2008-10-08	1	-0/+2
\| \| \| \| \|	Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: Patrick McHardy <kaber@trash.net>
*	netfilter: netns nat: per-netns NAT table	Alexey Dobriyan	2008-10-08	1	-0/+1
\| \| \| \| \| \| \|	Same story as with iptable_filter, iptables_raw tables. Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: Patrick McHardy <kaber@trash.net>
*	netfilter: netns nf_conntrack: per-netns conntrack accounting	Alexey Dobriyan	2008-10-08	1	-0/+2
\| \| \| \| \|	Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: Patrick McHardy <kaber@trash.net>
*	netfilter: netns nf_conntrack: per-netns ↵	Alexey Dobriyan	2008-10-08	1	-0/+1
\| \| \| \| \| \| \|	net.netfilter.nf_conntrack_log_invalid sysctl Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: Patrick McHardy <kaber@trash.net>
*	netfilter: netns nf_conntrack: per-netns net.netfilter.nf_conntrack_checksum ↵	Alexey Dobriyan	2008-10-08	1	-0/+1
\| \| \| \| \| \| \|	sysctl Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: Patrick McHardy <kaber@trash.net>
*	netfilter: netns nf_conntrack: per-netns net.netfilter.nf_conntrack_count sysctl	Alexey Dobriyan	2008-10-08	1	-0/+4
\| \| \| \| \| \| \| \|	Note, sysctl table is always duplicated, this is simpler and less special-cased. Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: Patrick McHardy <kaber@trash.net>
*	netfilter: netns nf_conntrack: per-netns statistics	Alexey Dobriyan	2008-10-08	1	-0/+1
\| \| \| \| \|	Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: Patrick McHardy <kaber@trash.net>
*	netfilter: netns nf_conntrack: per-netns event cache	Alexey Dobriyan	2008-10-08	1	-0/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Heh, last minute proof-reading of this patch made me think, that this is actually unneeded, simply because "ct" pointers will be different for different conntracks in different netns, just like they are different in one netns. Not so sure anymore. [Patrick: pointers will be different, flushing can only be done while inactive though and thus it needs to be per netns] Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: Patrick McHardy <kaber@trash.net>
*	netfilter: netns nf_conntrack: per-netns unconfirmed list	Alexey Dobriyan	2008-10-08	1	-0/+2
\| \| \| \| \| \| \| \|	What is confirmed connection in one netns can very well be unconfirmed in another one. Signed-off-by: Alexey Dobriyan <adobriyan@gmail.com> Signed-off-by: Patrick McHardy <kaber@trash.net>