op-kernel-dev - Development kernel branch for OpenPOWER systems

	Commit message (Collapse)	Author	Age	Files	Lines
*	[TCP]: tcp probe wraparound handling and other changes	Stephen Hemminger	2007-07-11	1	-70/+124
\| \| \| \| \| \| \| \| \| \| \| \| \|	Switch from formatting messages in probe routine and copying with kfifo, to using a small circular queue of information and formatting on read. This avoids wraparound issues with kfifo, and saves one copy. Also make sure to state correct license, rather than copying off some other driver I started with. Signed-off-by: Stephen Hemminger <shemminger@linux-foundation.org> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NET]: Make all initialized struct seq_operations const.	Philippe De Muyter	2007-07-10	6	-8/+8
\| \| \| \| \| \| \|	Make all initialized struct seq_operations in net/ const Signed-off-by: Philippe De Muyter <phdm@macqel.be> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[UDP]: Fix length check.	Patrick McHardy	2007-07-10	1	-7/+2
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Rémi Denis-Courmont wrote: > Right. By the way, shouldn't "len" rather be signed in there? > > unsigned int len; > > /* if we're overly short, let UDP handle it */ > len = skb->len - sizeof(struct udphdr); > if (len <= 0) > goto udp; It should, but the < 0 case can't happen since __udp4_lib_rcv already makes sure that we have at least a complete UDP header. Anyways, this patch fixes it. Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NET]: Avoid copying writable clones in tunnel drivers	Patrick McHardy	2007-07-10	2	-2/+4
\| \| \| \| \|	Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[IPV4]: Make ip_tos2prio const.	Philippe De Muyter	2007-07-10	1	-1/+1
\| \| \| \| \|	Signed-off-by: Philippe De Muyter <phdm@macqel.be> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER] net/ipv4/netfilter/ip_tables.c: lower printk severity	Dan Aloni	2007-07-10	1	-1/+1
\| \| \| \| \| \|	Signed-off-by: Dan Aloni <da-x@monatomic.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: Convert DEBUGP to pr_debug	Patrick McHardy	2007-07-10	22	-330/+189
\| \| \| \| \| \| \|	Convert DEBUGP to pr_debug and fix lots of non-compiling debug statements. Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: ipt_CLUSTERIP: add compat code	Patrick McHardy	2007-07-10	1	-19/+20
\| \| \| \| \| \| \| \| \|	Adjust structure size and don't expect pointers passed in from userspace to be valid. Also replace an enum in an ABI structure by a fixed size type. Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: ipt_SAME: add to feature-removal-schedule	Patrick McHardy	2007-07-10	1	-1/+1
\| \| \| \| \|	Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: nf_conntrack_expect: convert proc functions to hash	Patrick McHardy	2007-07-10	1	-23/+60
\| \| \| \| \| \| \|	Convert from the global expectation list to the hash table. Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: nf_conntrack: reduce masks to a subset of tuples	Patrick McHardy	2007-07-10	1	-6/+0
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Since conntrack currently allows to use masks for every bit of both helper and expectation tuples, we can't hash them and have to keep them on two global lists that are searched for every new connection. This patch removes the never used ability to use masks for the destination part of the expectation tuple and completely removes masks from helpers since the only reasonable choice is a full match on l3num, protonum and src.u.all. Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: nf_conntrack_expect: function naming unification	Patrick McHardy	2007-07-10	8	-28/+28
\| \| \| \| \| \| \| \| \| \|	Currently there is a wild mix of nf_conntrack_expect_, nf_ct_exp_, expect_, exp_, ... Consistently use nf_ct_ as prefix for exported functions. Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: nf_nat: use hlists for bysource hash	Patrick McHardy	2007-07-10	1	-10/+11
\| \| \| \| \|	Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: nf_conntrack: remove 'ignore_conntrack' argument from ↵	Patrick McHardy	2007-07-10	2	-3/+3
\| \| \| \| \| \| \| \| \|	nf_conntrack_find_get All callers pass NULL, this also doesn't seem very useful for modules. Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: nf_conntrack: use hlists for conntrack hash	Patrick McHardy	2007-07-10	1	-8/+9
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Convert conntrack hash to hlists to reduce its size and cache footprint. Since the default hashsize to max. entries ratio sucks (1:16), this patch doesn't reduce the amount of memory used for the hash by default, but instead uses a better ratio of 1:8, which results in the same max. entries value. One thing worth noting is early_drop. It really should use LRU, so it now has to iterate over the entire chain to find the last unconfirmed entry. Since chains shouldn't be very long and the entire operation is very rare this shouldn't be a problem. Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: nf_conntrack_extend: use __read_mostly for struct nf_ct_ext_type	Patrick McHardy	2007-07-10	1	-1/+1
\| \| \| \| \| \| \|	Also make them static. Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: nf_nat: merge nf_conn and nf_nat_info	Yasuyuki Kozakai	2007-07-10	2	-19/+17
\| \| \| \| \| \|	Signed-off-by: Yasuyuki Kozakai <yasuyuki.kozakai@toshiba.co.jp> Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: nf_nat: kill global 'destroy' operation	Yasuyuki Kozakai	2007-07-10	1	-24/+22
\| \| \| \| \| \| \| \| \| \|	This kills the global 'destroy' operation which was used by NAT. Instead it uses the extension infrastructure so that multiple extensions can register own operations. Signed-off-by: Yasuyuki Kozakai <yasuyuki.kozakai@toshiba.co.jp> Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: nf_conntrack: remove old memory allocator of conntrack	Yasuyuki Kozakai	2007-07-10	1	-6/+0
\| \| \| \| \| \| \| \| \|	Now memory space for help and NAT are allocated by extension infrastructure. Signed-off-by: Yasuyuki Kozakai <yasuyuki.kozakai@toshiba.co.jp> Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: nf_nat: remove unused nf_nat_module_is_loaded	Yasuyuki Kozakai	2007-07-10	2	-5/+0
\| \| \| \| \| \|	Signed-off-by: Yasuyuki Kozakai <yasuyuki.kozakai@toshiba.co.jp> Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: nf_nat: use extension infrastructure	Yasuyuki Kozakai	2007-07-10	3	-19/+65
\| \| \| \| \| \|	Signed-off-by: Yasuyuki Kozakai <yasuyuki.kozakai@toshiba.co.jp> Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: nf_nat: add reference to conntrack from entry of bysource list	Yasuyuki Kozakai	2007-07-10	1	-1/+3
\| \| \| \| \| \| \| \| \|	I will split 'struct nf_nat_info' out from conntrack. So I cannot use 'offsetof' to get the pointer to conntrack from it. Signed-off-by: Yasuyuki Kozakai <yasuyuki.kozakai@toshiba.co.jp> Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: nf_conntrack: use extension infrastructure for helper	Yasuyuki Kozakai	2007-07-10	1	-10/+0
\| \| \| \| \| \|	Signed-off-by: Yasuyuki Kozakai <yasuyuki.kozakai@toshiba.co.jp> Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: x_tables: mark matches and targets __read_mostly	Patrick McHardy	2007-07-10	23	-27/+27
\| \| \| \| \|	Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: x_tables: add TRACE target	Jozsef Kadlecsik	2007-07-10	2	-13/+118
\| \| \| \| \| \| \| \| \|	The TRACE target can be used to follow IP and IPv6 packets through the ruleset. Signed-off-by: Jozsef Kadlecsik <kadlec@blackhole.kfki.hu> Signed-off-by: Patrick NcHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: nf_nat_sip: only perform RTP DNAT if SIP session was SNATed	Jerome Borsboom	2007-07-10	1	-1/+5
\| \| \| \| \| \| \| \| \|	DNAT of the the RTP session is only necessary if the SIP session has been SNATed. Signed-off-by: Jerome Borsboom <j.borsboom@erasmusmc.nl> Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: Remove redundant parentheses/braces	Jan Engelhardt	2007-07-10	7	-34/+21
\| \| \| \| \| \| \| \| \|	Removes redundant parentheses and braces (And add one pair in a xt_tcpudp.c macro). Signed-off-by: Jan Engelhardt <jengelh@gmx.de> Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: Remove incorrect inline markers	Jan Engelhardt	2007-07-10	2	-2/+2
\| \| \| \| \| \| \| \| \|	device_cmp: the function's address is taken (call to nf_ct_iterate_cleanup) alloc_null_binding: referenced externally Signed-off-by: Jan Engelhardt <jengelh@gmx.de> Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: add some consts, remove some casts	Jan Engelhardt	2007-07-10	10	-28/+39
\| \| \| \| \| \| \| \|	Make a number of variables const and/or remove unneeded casts. Signed-off-by: Jan Engelhardt <jengelh@gmx.de> Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: x_tables: switch xt_target->checkentry to bool	Jan Engelhardt	2007-07-10	13	-97/+97
\| \| \| \| \| \| \| \|	Switch the return type of target checkentry functions to boolean. Signed-off-by: Jan Engelhardt <jengelh@gmx.de> Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: x_tables: switch xt_match->checkentry to bool	Jan Engelhardt	2007-07-10	5	-25/+25
\| \| \| \| \| \| \| \|	Switch the return type of match functions to boolean Signed-off-by: Jan Engelhardt <jengelh@gmx.de> Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: x_tables: switch xt_match->match to bool	Jan Engelhardt	2007-07-10	9	-65/+65
\| \| \| \| \| \| \| \|	Switch the return type of match functions to boolean Signed-off-by: Jan Engelhardt <jengelh@gmx.de> Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NETFILTER]: x_tables: switch hotdrop to bool	Jan Engelhardt	2007-07-10	10	-17/+17
\| \| \| \| \| \| \| \|	Switch the "hotdrop" variables to boolean Signed-off-by: Jan Engelhardt <jengelh@gmx.de> Signed-off-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[UDP]: Cleanup UDP encapsulation code	James Chapman	2007-07-10	2	-140/+130
\| \| \| \| \| \| \| \| \| \| \| \| \|	This cleanup fell out after adding L2TP support where a new encap_rcv funcptr was added to struct udp_sock. Have XFRM use the new encap_rcv funcptr, which allows us to move the XFRM encap code from udp.c into xfrm4_input.c. Make xfrm4_rcv_encap() static since it is no longer called externally. Signed-off-by: James Chapman <jchapman@katalix.com> Acked-by: Patrick McHardy <kaber@trash.net> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[TCP]: SACK fastpath did override adjusted fackets_out	Ilpo Järvinen	2007-07-10	1	-0/+8
\| \| \| \| \| \| \| \|	Do same adjustment to SACK fastpath counters provided that they're valid. Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@helsinki.fi> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[UDP]: Introduce UDP encapsulation type for L2TP	James Chapman	2007-07-10	1	-4/+23
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This patch adds a new UDP_ENCAP_L2TPINUDP encapsulation type for UDP sockets. When a UDP socket's encap_type is UDP_ENCAP_L2TPINUDP, the skb is delivered to a function pointed to by the udp_sock's encap_rcv funcptr. If the skb isn't wanted by L2TP, it returns >0, which causes it to be passed through to UDP. Include padding to put the new encap_rcv field on a 4-byte boundary. Previously, the only user of UDP encap sockets was ESP, so when CONFIG_XFRM was not defined, some of the encap code was compiled out. This patch changes that. As a result, udp_encap_rcv() will now do a little more work when CONFIG_XFRM is not defined. Signed-off-by: James Chapman <jchapman@katalix.com> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NET]: IPV6 checksum offloading in network devices	Stephen Hemminger	2007-07-10	3	-3/+6
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The existing model for checksum offload does not correctly handle devices that can offload IPV4 and IPV6 only. The NETIF_F_HW_CSUM flag implies device can do any arbitrary protocol. This patch: * adds NETIF_F_IPV6_CSUM for those devices * fixes bnx2 and tg3 devices that need it * add NETIF_F_IPV6_CSUM to ipv6 output (incl GSO) * fixes assumptions about NETIF_F_ALL_CSUM in nat * adjusts bridge union of checksumming computation Signed-off-by: David S. Miller <davem@davemloft.net>
*	[XFRM]: Add module alias for transformation type.	Masahide NAKAMURA	2007-07-10	4	-0/+4
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	It is clean-up for XFRM type modules and adds aliases with its protocol: ESP, AH, IPCOMP, IPIP and IPv6 for IPsec ROUTING and DSTOPTS for MIPv6 It is almost the same thing as XFRM mode alias, but it is added new defines XFRM_PROTO_XXX for preprocessing since some protocols are defined as enum. Signed-off-by: Masahide NAKAMURA <nakam@linux-ipv6.org> Acked-by: Ingo Oeser <netdev@axxeo.de> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[TCPv4]: Improve BH latency in /proc/net/tcp	Herbert Xu	2007-07-10	1	-14/+5
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently the code for /proc/net/tcp disable BH while iterating over the entire established hash table. Even though we call cond_resched_softirq for each entry, we still won't process softirq's as regularly as we would otherwise do which results in poor performance when the system is loaded near capacity. This anomaly comes from the 2.4 code where this was all in a single function and the local_bh_disable might have made sense as a small optimisation. The cost of each local_bh_disable is so small when compared against the increased latency in keeping it disabled over a large but mostly empty TCP established hash table that we should just move it to the individual read_lock/read_unlock calls as we do in inet_diag. Signed-off-by: Herbert Xu <herbert@gondor.apana.org.au> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[IPV4]: The scheduled removal of multipath cached routing support.	David S. Miller	2007-07-10	10	-1157/+11
\| \| \| \| \| \|	With help from Chris Wedgwood. Signed-off-by: David S. Miller <davem@davemloft.net>
*	[TCP] tcp_read_sock: Allow recv_actor() return return negative error value.	Jens Axboe	2007-06-23	1	-2/+6
\| \| \| \| \| \| \| \| \| \|	tcp_read_sock() currently assumes that the recv_actor() only returns number of bytes copied. For network splice receive, we may have to return an error in some cases. So allow the actor to return a negative error value. Signed-off-by: Jens Axboe <jens.axboe@oracle.com> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[IPVS]: Fix state variable on failure to start ipvs threads	Neil Horman	2007-06-18	1	-2/+39
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	ip_vs currently fails to reset its ip_vs_sync_state variable if the sync thread fails to start properly. The result is that the kernel will report a running daemon when their actuall is none. If you issue the following commands: 1. ipvsadm --start-daemon master --mcast-interface bla 2. ipvsadm -L --daemon 3. ipvsadm --stop-daemon master Assuming that bla is not an actual interface, step 2 should return no data, but instead returns: $ ipvsadm -L --daemon master sync daemon (mcast=bla, syncid=0) Signed-off-by: Neil Horman <nhorman@tuxdriver.com> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[TCP]: Fix logic breakage due to DSACK separation	Ilpo Järvinen	2007-06-15	1	-4/+5
\| \| \| \| \| \| \| \| \| \|	Commit 6f74651ae626ec672028587bc700538076dfbefb is found guilty of breaking DSACK counting, which should be done only for the SACK block reported by the DSACK instead of every SACK block that is received along with DSACK information. Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@helsinki.fi> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[TCP]: Congestion control API RTT sampling fix	Ilpo Järvinen	2007-06-15	5	-2/+16
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	Commit 164891aadf1721fca4dce473bb0e0998181537c6 broke RTT sampling of congestion control modules. Inaccurate timestamps could be fed to them without providing any way for them to identify such cases. Previously RTT sampler was called only if FLAG_RETRANS_DATA_ACKED was not set filtering inaccurate timestamps nicely. In addition, the new behavior could give an invalid timestamp (zero) to RTT sampler if only skbs with TCPCB_RETRANS were ACKed. This solves both problems. Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@helsinki.fi> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[TCP]: Add missing break to TCP option parsing code	Ilpo Järvinen	2007-06-14	1	-0/+1
\| \| \| \| \| \| \|	This flaw does not affect any behavior (currently). Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@helsinki.fi> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[TCP]: Set initial_ssthresh default to zero in Cubic and BIC.	David S. Miller	2007-06-13	2	-2/+2
\| \| \| \| \| \| \| \| \| \|	Because of the current default of 100, Cubic and BIC perform very poorly compared to standard Reno. In the worst case, this change makes Cubic and BIC as aggressive as Reno. So this change should be very safe. Signed-off-by: David S. Miller <davem@davemloft.net>
*	[TCP]: Fix left_out setting during FRTO	Ilpo Järvinen	2007-06-12	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \|	Without FRTO, the tcp_try_to_open is never called with lost_out > 0 (see tcp_time_to_recover). However, when FRTO is enabled, the !tp->lost condition is not used until end of FRTO because that way TCP avoids premature entry to fast recovery during FRTO. Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@helsinki.fi> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[TCP]: Disable TSO if MD5SIG is enabled.	David S. Miller	2007-06-12	1	-1/+2
\| \| \| \|	Signed-off-by: David S. Miller <davem@davemloft.net>
*	[CIPSO]: Fix several unaligned kernel accesses in the CIPSO engine.	Paul Moore	2007-06-08	1	-10/+13
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	IPv4 options are not very well aligned within the packet and the format of a CIPSO option is even worse. The result is that the CIPSO engine in the kernel does a few unaligned accesses when parsing and validating incoming packets with CIPSO options attached which generate error messages on certain alignment sensitive platforms. This patch fixes this by marking these unaligned accesses with the get_unaliagned() macro. Signed-off-by: Paul Moore <paul.moore@hp.com> Acked-by: James Morris <jmorris@namei.org> Signed-off-by: David S. Miller <davem@davemloft.net>
*	[NetLabel]: consolidate the struct socket/sock handling to just struct sock	Paul Moore	2007-06-08	1	-33/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The current NetLabel code has some redundant APIs which allow both "struct socket" and "struct sock" types to be used; this may have made sense at some point but it is wasteful now. Remove the functions that operate on sockets and convert the callers. Not only does this make the code smaller and more consistent but it pushes the locking burden up to the caller which can be more intelligent about the locks. Also, perform the same conversion (socket to sock) on the SELinux/NetLabel glue code where it make sense. Signed-off-by: Paul Moore <paul.moore@hp.com> Acked-by: James Morris <jmorris@namei.org> Signed-off-by: David S. Miller <davem@davemloft.net>