|
|
Line 1: |
Line 1: |
| {{Technical|reason=I am a very intelligent native speaker of the English language, and the lead section of this article is completely incomprehensible to me. Highly technical language may be appropriate in later sections of an article on a highly technical subject, but the lead section of an article on ANY subject should be comprehensible to a person who is not an expert in the field, without having to follow links to other articles that may be no more comprehensible than it is. This one fails completely.|date=October 2011}}
| | Aleta is what's written [http://Statigr.am/tag/relating relating] to her [http://pinterest.com/search/pins/?q=birth+certificate birth certificate] life-style and offer she doesn't really for example , being called like it. Massachusetts is where he's always been living. Managing males is what she does in her day task but she's always sought after her own business. To drive is something her life partner doesn't really like but yet she does. She is running and consistently a blog here: http://circuspartypanama.com<br><br>Have a look at my web page: how to hack clash of clans ([http://circuspartypanama.com More methods]) |
| | |
| {{Infobox data structure
| |
| |name=Skip List
| |
| |type=List
| |
| |invented_by=[[William Pugh|W. Pugh]]
| |
| |invented_year=1989
| |
| |
| |
| |space_avg=O(n)
| |
| |space_worst=O(n log n)<ref name="cs.uwaterloo">http://www.cs.uwaterloo.ca/research/tr/1993/28/root2side.pdf</ref>
| |
| |search_avg=O(log n)
| |
| |search_worst=O(n)<ref name="cs.uwaterloo" />
| |
| |insert_avg=O(log n)
| |
| |insert_worst=O(n)
| |
| |delete_avg=O(log n)
| |
| |delete_worst=O(n)
| |
| }}
| |
| {{Probabilistic}}
| |
| In [[computer science]], a '''skip list''' is a [[data structure]] that allows fast search within an [[ordered sequence]] of elements. Fast search is made possible by maintaining a linked hierarchy of subsequences, each skipping over fewer elements. Searching starts in the sparsest subsequence until two consecutive elements have been found, one smaller and one larger than the element searched for. Via the linked hierarchy these two elements link to elements of the next sparsest subsequence where searching is continued until finally we are searching in the full sequence. The elements that are skipped over may be chosen probabilistically.<ref name="pugh">{{cite journal | url=ftp://ftp.cs.umd.edu/pub/skipLists/skiplists.pdf | title=Skip lists: a probabilistic alternative to balanced trees | last=Pugh | first=William | journal=Communications of the ACM |date=June 1990 | volume=33 | issue=6 | pages=668–676 | doi=10.1145/78973.78977}}</ref><ref>[http://www.ic.unicamp.br/~celio/peer2peer/skip-net-graph/deterministic-skip-lists-munro.pdf Deterministic skip lists]</ref>
| |
| | |
| [[Image:Skip list.svg|center]]
| |
| | |
| == Description == | |
| | |
| A skip list is built in layers. The bottom layer is an ordinary ordered [[linked list]]. Each higher layer acts as an "express lane" for the lists below, where an element in layer ''i'' appears in layer ''i''+1 with some fixed probability ''p'' (two commonly used values for ''p'' are 1/2 or 1/4). On average, each element appears in 1/(1-''p'') lists, and the tallest element (usually a special head element at the front of the skip list) in <math>\log_{1/p} n\,</math> lists.
| |
| | |
| A search for a target element begins at the head element in the top list, and proceeds horizontally until the current element is greater than or equal to the target. If the current element is equal to the target, it has been found. If the current element is greater than the target, or the search reaches the end of the linked list, the procedure is repeated after returning to the previous element and dropping down vertically to the next lower list. The expected number of steps in each linked list is at most 1/''p'', which can be seen by tracing the search path backwards from the target until reaching an element that appears in the next higher list or reaching the beginning of the current list. Therefore, the total ''expected'' cost of a search is <math>(\log_{1/p} n)/p,\,</math> which is <math>\mathcal{O}(\log n)\,</math> when ''p'' is a constant. By choosing different values of ''p'', it is possible to trade search costs against storage costs.
| |
| | |
| === Implementation details ===
| |
| [[File:Skip list add element-en.gif|thumb|Skip list add element-en|500px|Inserting elements to skip list]]
| |
| The elements used for a skip list can contain more than one pointer since they can participate in more than one list.
| |
| | |
| Insertions and deletions are implemented much like the corresponding linked-list operations, except that "tall" elements must be inserted into or deleted from more than one linked list.
| |
| | |
| <math>\mathcal{O}(n)</math> operations, which force us to visit every node in ascending order (such as printing the entire list), provide the opportunity to perform a behind-the-scenes derandomization of the level structure of the skip-list in an optimal way, bringing the skip list to <math>\mathcal{O}(\log n)</math> search time. (Choose the level of the i'th finite node to be 1 plus the number of times we can repeatedly divide i by 2 before it becomes odd. Also, i=0 for the negative infinity header as we have the usual special case of choosing the highest possible level for negative and/or positive infinite nodes.) However this also allows someone to know where all of the higher-than-level 1 nodes are and delete them.
| |
| | |
| Alternatively, we could make the level structure quasi-random in the following way:
| |
| | |
| make all nodes level 1
| |
| j ← 1
| |
| '''while''' the number of nodes at level j > 1 '''do'''
| |
| '''for''' each i'th node at level j '''do'''
| |
| '''if''' i is odd
| |
| '''if''' i is not the last node at level j
| |
| randomly choose whether to promote it to level j+1
| |
| '''else'''
| |
| do not promote
| |
| '''end if'''
| |
| '''else if''' i is even and node i-1 was not promoted
| |
| promote it to level j+1
| |
| '''end if'''
| |
| '''repeat'''
| |
| j ← j + 1
| |
| '''repeat'''
| |
| | |
| Like the derandomized version, quasi-randomization is only done when there is some other reason to be running a <math>\mathcal{O}(n)</math> operation (which visits every node).
| |
| | |
| The advantage of this quasi-randomness is that it doesn't give away nearly as much level-structure related information to an [[Adversary (online algorithm)|adversarial user]] as the de-randomized one. This is desirable because an adversarial user who is able to tell which nodes are not at the lowest level can pessimize performance by simply deleting higher-level nodes. The search performance is still guaranteed to be logarithmic.
| |
| | |
| It would be tempting to make the following "optimization": In the part which says "Next, for each i'th...", forget about doing a coin-flip for each even-odd pair. Just flip a coin once to decide whether to promote only the even ones or only the odd ones. Instead of <math>\mathcal{O}(n \log n)</math> coin flips, there would only be <math>\mathcal{O}(\log n)</math> of them. Unfortunately, this gives the adversarial user a 50/50 chance of being correct upon guessing that all of the even numbered nodes (among the ones at level 1 or higher) are higher than level one. This is despite the property that he has a very low probability of guessing that a particular node is at level ''N'' for some integer ''N''.
| |
| | |
| A skip list does not provide the same absolute worst-case performance guarantees as more traditional [[balanced tree]] data structures, because it is always possible (though with very low probability) that the coin-flips used to build the skip list will produce a badly balanced structure. However, they work well in practice, and the randomized balancing scheme has been argued to be easier to implement than the deterministic balancing schemes used in balanced binary search trees. Skip lists are also useful in [[parallel computing]], where insertions can be done in different parts of the skip list in parallel without any global rebalancing of the data structure. Such parallelism can be especially advantageous for resource discovery in an ad-hoc [[Wireless network]] because a randomized skip list can be made robust to the loss of any single node.<ref>{{cite paper | last=Shah | first=Gauri Ph.D. | coauthors=James Aspnes | title=Distributed Data Structures for Peer-to-Peer Systems | date=December 2003 | url=http://www.cs.yale.edu/homes/shah/pubs/thesis.pdf | format=PDF | accessdate=2008-09-23}}</ref>
| |
| | |
| There has been some evidence that skip lists have worse real-world performance and space requirements than [[B tree]]s due to [[memory locality]] and other issues.<ref>http://resnet.uoregon.edu/~gurney_j/jmpc/skiplist.html</ref>
| |
| | |
| === Indexable skiplist ===
| |
| | |
| As described above, a skiplist is capable of fast <math>\mathcal{O}(\log n)</math> insertion and removal of values from a sorted sequence, but it has only slow <math>\mathcal{O}(n)</math> lookups of values at a given position in the sequence (i.e. return the 500th value); however, with a minor modification the speed of [[random access]] indexed lookups can be improved to <math>\mathcal{O}(\log n)</math>.
| |
| | |
| For every link, also store the width of the link. The width is defined as the number of bottom layer links being traversed by each of the higher layer "express lane" links.
| |
| | |
| For example, here are the widths of the links in the example at the top of the page:
| |
| | |
| 1 10
| |
| o---> o---------------------------------------------------------> o Top level
| |
| 1 3 2 5
| |
| o---> o---------------> o---------> o---------------------------> o Level 3
| |
| 1 2 1 2 5
| |
| o---> o---------> o---> o---------> o---------------------------> o Level 2
| |
| 1 1 1 1 1 1 1 1 1 1 1
| |
| o---> o---> o---> o---> o---> o---> o---> o---> o---> o---> o---> o Bottom level
| |
| ''' '''
| |
| Head 1st 2nd 3rd 4th 5th 6th 7th 8th 9th 10th NIL
| |
| Node Node Node Node Node Node Node Node Node Node
| |
| | |
| Notice that the width of a higher level link is the sum of the component links below it (i.e. the width 10 link spans the links of widths 3, 2 and 5 immediately below it). Consequently, the sum of all widths is the same on every level (10 + 1 = 1 + 3 + 2 + 5 = 1 + 2 + 1 + 2 + 5).
| |
| | |
| To index the skiplist and find the i'th value, traverse the skiplist while counting down the widths of each traversed link. Descend a level whenever the upcoming width would be too large. | |
| | |
| For example, to find the node in the fifth position (Node 5), traverse a link of width 1 at the top level. Now four more steps are needed but the next width on this level is ten which is too large, so drop one level. Traverse one link of width 3. Since another step of width 2 would be too far, drop down to the bottom level. Now traverse the final link of width 1 to reach the target running total of 5 (1+3+1).
| |
|
| |
| '''function''' lookupByPositionIndex(i)
| |
| node ← head
| |
| i ← i + 1 ''# don't count the head as a step''
| |
| '''for''' level '''from''' top '''to''' bottom '''do'''
| |
| '''while''' i ≥ node.width[level] '''do''' ''# if next step is not too far''
| |
| i ← i - node.width[level] ''# subtract the current width''
| |
| node ← node.next[level] ''# traverse forward at the current level''
| |
| '''repeat'''
| |
| '''repeat'''
| |
| '''return''' node.value
| |
| '''end function'''
| |
| | |
| This method of implementing indexing is detailed in [http://cg.scs.carleton.ca/~morin/teaching/5408/refs/p90b.pdf Section 3.4 Linear List Operations in "A skip list cookbook" by William Pugh].
| |
| | |
| ==History==
| |
| | |
| Skip lists were first described in 1990 by [[William Pugh]].<ref name="pugh"/>
| |
| | |
| To quote the author:
| |
| | |
| :''Skip lists are a probabilistic data structure that seem likely to supplant balanced trees as the implementation method of choice for many applications. Skip list algorithms have the same asymptotic expected time bounds as balanced trees and are simpler, faster and use less space.''
| |
| | |
| ==Usages==
| |
| List of applications and frameworks that use skip lists:
| |
| *[[Cyrus IMAP server]] offers a "skiplist" backend DB implementation ([http://git.cyrusimap.org/cyrus-imapd/tree/lib/cyrusdb_skiplist.c source file])
| |
| *[[Lucene]] uses skip lists to search delta-encoded posting lists in logarithmic time.
| |
| *[http://qt-project.org/doc/qt-4.8/qmap.html#details QMap] (up to Qt 4) template class of [[Qt (framework)|Qt]] that provides a dictionary.
| |
| *[[Redis]], an ANSI-C open-source persistent key/value store for Posix systems, uses skip lists in its implementation of ordered sets.<ref>{{cite web | title=Redis ordered set implementation | url=https://github.com/antirez/redis/blob/unstable/src/t_zset.c}}</ref>
| |
| *[https://github.com/shuttler/nessDB nessDB], a very fast key-value embedded Database Storage Engine (Using log-structured-merge (LSM) trees), uses skip lists for its memtable.
| |
| *[http://www.dekorte.com/projects/opensource/skipdb/ skipdb] is an open-source database format using ordered key/value pairs.
| |
| * [http://download.oracle.com/javase/6/docs/api/java/util/concurrent/ConcurrentSkipListSet.html ConcurrentSkipListSet] and [http://download.oracle.com/javase/6/docs/api/java/util/concurrent/ConcurrentSkipListMap.html ConcurrentSkipListMap] in the Java 1.6 API.
| |
| *[https://code.google.com/p/leveldb/ leveldb], a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values
| |
| * [http://code.activestate.com/recipes/576930/ Skip lists are used for efficient statistical computations] of [[Moving average#Moving median|running medians]] (also known as moving medians).
| |
| Skip lists are also used in distributed applications (where the nodes represent physical computers, and pointers represent network connections) and for implementing highly scalable concurrent priority queues with less lock contention,<ref>[http://dx.doi.org/10.1109/IPDPS.2000.845994 Skiplist-based concurrent priority queues]</ref> or even without locking,<ref>{{cite doi|10.1109/IPDPS.2003.1213189}}</ref><ref>{{cite doi|10.1145/1011767.1011776}}</ref><ref>{{cite doi|10.1109/ISPA.2008.90}}</ref> as well lockless concurrent dictionaries.<ref>{{cite doi|10.1145/967900.968188}}</ref> There are also several US patents for using skip lists to implement (lockless) priority queues and concurrent dictionaries.{{citation needed|date=October 2011}}
| |
| | |
| ==See also==
| |
| *[[Bloom filter]]
| |
| *[[Skip graph]]
| |
| * Skip trees, an alternative data structure to Skip lists in a concurrent approach: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.47.514
| |
| * Skip tree graphs: http://www0.cs.ucl.ac.uk/staff/a.gonzalezbeltran/pubs/icc2007.pdf, http://www0.cs.ucl.ac.uk/staff/a.gonzalezbeltran/pubs/AGB-comcom08.pdf
| |
| | |
| ==References==
| |
| <references/>
| |
| | |
| ==External links==
| |
| *[http://nist.gov/dads/HTML/skiplist.html "Skip list" entry] in the [[Dictionary of Algorithms and Data Structures]]
| |
| *[http://msdn.microsoft.com/en-us/library/ms379573(VS.80).aspx#datastructures20_4_topic4 Skip Lists: A Linked List with Self-Balancing BST-Like Properties] on [[MSDN]] in C# 2.0
| |
| *[http://dekorte.com/projects/opensource/SkipDB/ SkipDB, a BerkeleyDB-style database implemented using skip lists.]
| |
| *[http://videolectures.net/mit6046jf05_demaine_lec12/ Skip Lists lecture (MIT OpenCourseWare: Introduction to Algorithms) ]
| |
| *[http://opendatastructures.org/versions/edition-0.1e/ods-java/4_Skiplists.html Open Data Structures - Chapter 4 - Skiplists]
| |
| | |
| ;Demo applets
| |
| *[http://people.ksp.sk/~kuko/bak/index.html Skip List Applet] by Kubo Kovac
| |
| *[http://iamwww.unibe.ch/~wenger/DA/SkipList/ Thomas Wenger's demo applet on skiplists]
| |
| ;Implementations
| |
| *[http://codingplayground.blogspot.com/2009/01/generic-skip-list-skiplist.html A generic Skip List in C++] by Antonio Gulli
| |
| *[https://metacpan.org/module/Algorithm::SkipList Algorithm::SkipList, implementation in Perl on CPAN]
| |
| *[http://infohost.nmt.edu/tcc/help/lang/python/examples/pyskip/ John Shipman's implementation in Python]
| |
| *[http://code.activestate.com/recipes/576930/ Raymond Hettinger's implementation in Python]
| |
| *[http://love2d.org/wiki/Skip_list A Lua port of John Shipman's Python version]
| |
| *[https://gist.github.com/dmx2010/5426422 Java Implementation with index based access]
| |
| *[http://java.sun.com/javase/6/docs/api/java/util/concurrent/ConcurrentSkipListSet.html ConcurrentSkipListSet documentation for Java 6] (and [http://www.docjar.com/html/api/java/util/concurrent/ConcurrentSkipListSet.java.html sourcecode])
| |
| | |
| {{Data structures}}
| |
| | |
| {{DEFAULTSORT:Skip List}}
| |
| [[Category:1989 introductions]]
| |
| [[Category:Linked lists]]
| |
| [[Category:Probabilistic data structures]]
| |
| | |
| [[de:Liste (Datenstruktur)#Skip-Liste]]
| |