您的位置：首页 > 运维架构 > Linux

Linux内核中链表和散列表的实现原理揭秘

2011-07-14 11:37 246 查看

By沈东良(良少)http://blog.csdn.net/shendlLinux内核的实现，大量使用了数据结构，包括了数组、链表和散列表。其中用的最多的是双向循环链表。Linux内核使用的是自己定义的链表和散列表，简单而高效，使用方法也非常的别具一格。研究Linux内核的链表和散列表对于看懂Linux内核源代码有重要的意义。本文基于kernel2.6.39版本进行分析。

Linux的链表和散列表定义在include/linux/types.h文件中

structlist_head{
223structlist_head*next,*prev;
224};
225

226structhlist_head{
227structhlist_node*first;
228};
229

230structhlist_node{
231structhlist_node*next,**pprev;
232};
233

list_head就是使用最为广泛的双向循环链表。这个数据结构可以说是LinuxKernel的基石，大量内核源代码使用了这个数据结构。hlist_head和hlist_node常常用于散列表中。

Linux的链表和散列表的操作函数的定义在include/linux/list.h文件中

初始化双向循环链表，只有一个元素的双向循环链表，next和prev指向自身。staticinlinevoidINIT_LIST_HEAD(structlist_head*list)25{26list->next=list;27list->prev=list;28}29初始化散列表的链表。空的散列表链表的first==NULL。每一个散列表链表的元素初始化时next和pprev指针都是NULL，而不是指向自身。我们可以看到，散列表链表hlist_node虽然和双向循环链表list_head一样，都有两个指针，但有本质的区别。散列表链表hlist_node不是循环链表。它有头和尾，是单向的链表。散列表链表hlist_node之所以有两个指针，是为了提高插入和删除链表的效率。hlist_node的插入，只需要一个相邻的hlist_head或者hlist_node节点即可。它的删除，只需要它本身即可定位其相邻的前后两个元素。570571#defineHLIST_HEAD_INIT{.first=NULL}572#defineHLIST_HEAD(name)structhlist_headname={.first=NULL}573#defineINIT_HLIST_HEAD(ptr)((ptr)->first=NULL)574staticinlinevoidINIT_HLIST_NODE(structhlist_node*h)575{576h->next=NULL;577h->pprev=NULL;578}579

脱离链表的元素的状态

staticinlinevoid__list_add(structlist_head*new,38structlist_head*prev,39structlist_head*next)40{41next->prev=new;42new->next=next;43new->prev=prev;44prev->next=new;45}46/*80*Deletealistentrybymakingtheprev/nextentries81*pointtoeachother.82*83*Thisisonlyforinternallistmanipulationwhereweknow84*theprev/nextentriesalready!85*/86staticinlinevoid__list_del(structlist_head*prev,structlist_head*next)87{88next->prev=prev;89prev->next=next;90}9192/**93*list_del-deletesentryfromlist.94*@entry:theelementtodeletefromthelist.95*Note:list_empty()onentrydoesnotreturntrueafterthis,theentryis96*inanundefinedstate.97*/98#ifndefCONFIG_DEBUG_LIST99staticinlinevoid__list_del_entry(structlist_head*entry)100{101__list_del(entry->prev,entry->next);102}103104staticinlinevoidlist_del(structlist_head*entry)105{106__list_del(entry->prev,entry->next);107entry->next=LIST_POISON1;108entry->prev=LIST_POISON2;109}110#else散列表链表的脱离链表代码：90staticinlinevoid__hlist_del(structhlist_node*n)591{592structhlist_node*next=n->next;593structhlist_node**pprev=n->pprev;594*pprev=next;595if(next)596next->pprev=pprev;597}598599staticinlinevoidhlist_del(structhlist_node*n)600{601__hlist_del(n);602n->next=LIST_POISON1;603n->pprev=LIST_POISON2;604}605看看LIST_POISON1和LIST_POISON2是何方神圣。1617/*18*Thesearenon-NULLpointersthatwillresultinpagefaults19*undernormalcircumstances,usedtoverifythatnobodyuses20*non-initializedlistentries.21*/22#defineLIST_POISON1((void*)0x00100100+POISON_POINTER_DELTA)23#defineLIST_POISON2((void*)0x00200200+POISON_POINTER_DELTA)24表示链表元素是未初始化的，既不在链表中，也没有经过初始化，不应该使用。

遍历Linuxkernel的链表时删除元素的方法

在list_head双向循环链表的迭代中删除元素的方法，见我之前写的《遍历Linuxkernel的链表时删除元素的方法》一文，地址：/article/1672140.html散列表链表也有同样的问题：665pos是当前元素666#definehlist_for_each(pos,head)\667for(pos=(head)->first;pos&&({prefetch(pos->next);1;});\668pos=pos->next)669多了一个n元素，保存当前元素的下一个元素，这样就可以避免删除当前元素后，无法继续迭代的问题了。670#definehlist_for_each_safe(pos,n,head)\671for(pos=(head)->first;pos&&({n=pos->next;1;});\672pos=n)673

链表元素的使用方式和container_of宏

LinuxKernel的链表使用方式和其他一般的链表的使用方式大相径庭！一般的链表，总是有一个链表结构体，它的内部有指针指向实际的对象。一般是这样子的：structnode{structnode*next;structnode*prev;void*data;};然后有一些操作函数，负责使用这样的链表元素实现链表。这应该算是非常标准的链表形式了。大量已有的各类语言的链表都是类似这样实现的。再回头看看LinuxKernel的链表定义：structlist_head{223structlist_head*next,*prev;224};225226structhlist_head{227structhlist_node*first;228};229230structhlist_node{231structhlist_node*next,**pprev;232};咦！实际的数据元素呢？这种链表有什么用？一堆指针链接在一起，却没有数据，有个毛用！:-)确实挺奇怪的，放眼全球各类语言，也没有哪一个数据结构库的链表是这样子的！我们看看kernel是怎样使用这些链表的：structinode{736/*RCUpathlookuptouchesfollowing:*/737umode_ti_mode;738uid_ti_uid;739gid_ti_gid;740conststructinode_operations*i_op;741structsuper_block*i_sb;742743spinlock_ti_lock;/*i_blocks,i_bytes,maybei_size*/744unsignedinti_flags;745structmutexi_mutex;746747unsignedlongi_state;

内容来自用户分享和网络整理，不保证内容的准确性，如有侵权内容，可联系管理员处理

标签：

相关文章推荐

新的分享

章节导航

Linux内核中链表和散列表的实现原理揭秘

Linux的链表和散列表定义在include/linux/types.h文件中

Linux的链表和散列表的操作函数的定义在include/linux/list.h文件中

脱离链表的元素的状态

遍历Linuxkernel的链表时删除元素的方法

链表元素的使用方式和container_of宏

谜底在就在container_of宏：

LinuxKernel如何实现散列表

实例Inode-cache

实例Dentrycache