您的位置:首页 > 理论基础 > 计算机网络

C/C++网络编程中的TCP保活(转载)

2012-03-30 11:41 190 查看
原帖地址:http://blog.csdn.net/weiqubo/article/details/7225338

在默认的情况下,TCP连接是没有保活的心跳的。这就是说,当一个TCP的socket,客户端与服务端谁也不发送数据,会一直保持着连接。这其中如果有一方异常掉线,另一端永远也不可能知道。这对于一些服务型的程序来说,将是灾难性的后果。

  所以,必须对创建的socket,启用保活心跳,即Keepalive选项。


启用Keepalive

  对于WIN32或者Linux平台来说,设置socket的Keepalive都很简单,只需使用setsockopt设置SO_KEEPALIVE即可。

  setsockopt的函数原型在Linux环境下为:

[html]
view plaincopyprint?

#include <sys/types.h>
#include <sys/socket.h>

int setsockopt(int s, int level, int optname,
const void *optval,
socklen_t optlen);

[html]
view plaincopyprint?

#include <winsock2.h>

int setsockopt(int s, int level, int optname,
const char *optval,
int optlen);

#include <winsock2.h>

int setsockopt(int s, int level, int optname,
const char *optval,
int optlen);


  因为const void *可以接受const char *型的参数,所以为了代码的跨平台编译考虑,可以采用以下代码来设置TCP的Keepalive选项。

[html]
view plaincopyprint?

alive = 1;
if (setsockopt
(fd, SOL_SOCKET, SO_KEEPALIVE, (const char *) &alive,
sizeof alive) != 0)
{
log_warn ("Set keep alive error: %s.\n", strerror (errno));
return -1;
}

  为什么说是系统默认值的呢?因为有这样几个值,我们并没有手动设置,是采用的系统默认值。即,

多长时间发送一次保活心跳?
如果没有返回,多长时间再重试发送?
重试几次为失败?
  如果是Linux操作系统,这三个值分别为

[html]
view plaincopyprint?

# cat /proc/sys/net/ipv4/tcp_keepalive_time
7200
# cat /proc/sys/net/ipv4/tcp_keepalive_intvl
75
# cat /proc/sys/net/ipv4/tcp_keepalive_probes
9

# cat /proc/sys/net/ipv4/tcp_keepalive_time
7200
# cat /proc/sys/net/ipv4/tcp_keepalive_intvl
75
# cat /proc/sys/net/ipv4/tcp_keepalive_probes
9


  这就是说,在Linux系统下,如果对于TCP的socket启用了Keepalive选项,则会在7200秒(即两个小时)没有数据后,发起KEEPALIVE报文。如果没有回应,则会在75秒后再次重试。如果重试9次均失败,则认定连接已经失效。TCP的读取操作,将返回0。

  这对于我们大多数应用来说,前两个时间值都有点太长了。

  我们可以通过重设上面三个值,来使得操作系统上运行的所有启用了Keepalive选项的TCP的socket的行为更改。

  我们也可以只针对我们自己创建的socket,重设这三个值。它们分别对应TCP_KEEPIDLE、TCP_KEEPINTL和TCP_KEEPCNT的选项值,同样可以使用setsockopt进行设置。

[html]
view plaincopyprint?

#include <stdlib.h>
#include <fcntl.h>
#include <errno.h>
#include <sys/socket.h>
#include <netinet/tcp.h>
#include <netinet/in.h>
#include <netdb.h>
#include <arpa/inet.h>

int
socket_set_keepalive (int fd)
{
int ret, error, flag, alive, idle, cnt, intv;

/* Set: use keepalive on fd */
alive = 1;
if (setsockopt
(fd, SOL_SOCKET, SO_KEEPALIVE, &alive,
sizeof alive) != 0)
{
log_warn ("Set keepalive error: %s.\n", strerror (errno));
return -1;
}

/* 10秒钟无数据,触发保活机制,发送保活包 */
idle = 10;
if (setsockopt (fd, SOL_TCP, TCP_KEEPIDLE, &idle, sizeof idle) != 0)
{
log_warn ("Set keepalive idle error: %s.\n", strerror (errno));
return -1;
}

/* 如果没有收到回应,则5秒钟后重发保活包 */
intv = 5;
if (setsockopt (fd, SOL_TCP, TCP_KEEPINTVL, &intv, sizeof intv) != 0)
{
log_warn ("Set keepalive intv error: %s.\n", strerror (errno));
return -1;
}

/* 连续3次没收到保活包,视为连接失效 */
cnt = 3;
if (setsockopt (fd, SOL_TCP, TCP_KEEPCNT, &cnt, sizeof cnt) != 0)
{
log_warn ("Set keepalive cnt error: %s.\n", strerror (errno));
return -1;
}

return 0;
}

  而WIN32环境下的参数设置,就要麻烦一些,需要使用另外的一个函数WSAIoctl和一个结构struct tcp_keepalive。

  它们的原型分别为:

[html]
view plaincopyprint?

#include <winsock2.h>
#include <mstcpip.h>

int WSAIoctl(
SOCKET s,
DWORD dwIoControlCode,
LPVOID lpvInBuffer,
DWORD cbInBuffer,
LPVOID lpvOutBuffer,
DWORD cbOutBuffer,
LPDWORD lpcbBytesReturned,
LPWSAOVERLAPPED lpOverlapped,
LPWSAOVERLAPPED_COMPLETION lpCompletionRoutine
);

struct tcp_keepalive {
u_long onoff;
u_long keepalivetime;
u_long keepaliveinterval;
};

#include <winsock2.h>
#include <mstcpip.h>

int WSAIoctl(
SOCKET s,
DWORD dwIoControlCode,
LPVOID lpvInBuffer,
DWORD cbInBuffer,
LPVOID lpvOutBuffer,
DWORD cbOutBuffer,
LPDWORD lpcbBytesReturned,
LPWSAOVERLAPPED lpOverlapped,
LPWSAOVERLAPPED_COMPLETION lpCompletionRoutine
);

struct tcp_keepalive {
u_long onoff;
u_long keepalivetime;
u_long keepaliveinterval;
};
  在这里,使用WSAIoctl的时候,dwIoControlCode要使用SIO_KEEPALIVE_VALS,lpvOutBuffer用不上,cbOutBuffer必须设置为0。

  struct tcp_keepalive结构的参数意义为:

  onoff,是否开启KEEPALIVE; keepalivetime,多长时间触发Keepalive报文的发送; keepaliveinterval,多长时间没有回应触发下一次发送。

  注意:这里两个时间单位都是毫秒而不是秒。

[html]
view plaincopyprint?

#include <winsock2.h>
#include <mstcpip.h>

int
socket_set_keepalive (int fd)
{
struct tcp_keepalive kavars[1] = {
1,
10 * 1000, /* 10 seconds */
5 * 1000 /* 5 seconds */
};

/* Set: use keepalive on fd */
alive = 1;
if (setsockopt
(fd, SOL_SOCKET, SO_KEEPALIVE, (const char *) &alive,
sizeof alive) != 0)
{
log_warn ("Set keep alive error: %s.\n", strerror (errno));
return -1;
}

if (WSAIoctl
(fd, SIO_KEEPALIVE_VALS, kavars, sizeof kavars, NULL, sizeof (int), &ret, NULL,
NULL) != 0)
{
log_warn ("Set keep alive error: %s.\n", strerror (WSAGetLastError ()));
return -1;
}

return 0;
}

#include <winsock2.h>
#include <mstcpip.h>

int
socket_set_keepalive (int fd)
{
struct tcp_keepalive kavars[1] = {
1,
10 * 1000, /* 10 seconds */
5 * 1000 /* 5 seconds */
};

/* Set: use keepalive on fd */
alive = 1;
if (setsockopt
(fd, SOL_SOCKET, SO_KEEPALIVE, (const char *) &alive,
sizeof alive) != 0)
{
log_warn ("Set keep alive error: %s.\n", strerror (errno));
return -1;
}

if (WSAIoctl
(fd, SIO_KEEPALIVE_VALS, kavars, sizeof kavars, NULL, sizeof (int), &ret, NULL,
NULL) != 0)
{
log_warn ("Set keep alive error: %s.\n", strerror (WSAGetLastError ()));
return -1;
}

return 0;
}
内容来自用户分享和网络整理,不保证内容的准确性,如有侵权内容,可联系管理员处理 点击这里给我发消息
标签: