Ukbench图像数据集
2016-01-10 10:24
1396 查看
Ukbench图像数据集官网地址:http://www.vis.uky.edu/~stewe/ukbench/
Stewenius
Revised set!In the first set which went online there were some errors. Most notably one subset being included twice. Also some transposed images. Tests on the old set are invalid.Recognition Benchmark ImagesHenrik Stewénius and David NistérThe set consists of N groups of 4 images each. All the images are 640x480. If you use the dataset, please refer to: D. Nistér and H. Stewénius. Scalable recognition with a vocabulary tree. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), volume 2, pages 2161-2168, June 2006. [ bib | .ppt | .pdf ] SubsetsFor users of subsets of the database please note that the difficulty is dependent on the chosen subset. Important factors are:Difficulty of the objects themselves. CD-covers are much easier than flowers. See performance curve below. Sharpness of the images. Many of the indoor images are somewhat blurry and this can affect some algorithms. Similar or identical objects. All the pictures where taken by CS students/faculty/staff and thus keyboards and computer equipment are popular motives. So is computer vision literature. DownloadPlease note BEFORE starting your download that the file is almost 2GB. Please save a local copy in order to save bandwidth at our server.Zipped File.Visual Words. We extracted visual words for each document and wrote them one document per line. Data before ":" is header and then data. The vocabulary was6 levels and splitting with a factor of 10. The vocabulary was trained on non-related data. PerformanceIn the paper we give results either for a subset of 6376 images (all we had at that time) or a smaller subset of 1400 images. The smaller set was used when we did not have an efficient enough implementation in order to handle the larger set.Performance MeasuresOur simplest measure of performance is to count how many of the 4 images which are top-4 when using a query image from that set of four images.A matlab implementation which computes this measure: Download. Numbers for computing our measure on the full 10200 database using different training-sets and different scoring strategies:
How our performance varies when taking subsets 0:n from the set. The different curces represent different choices in scoring strategy. For extremely fast applications we use the flat-scoring while for better performance we use hierarchical scoring. The feature extractor was set to use relatively few features for these experiments. How the score is computedint nrblocks = nr_docs/4; int totaltopcount = 0; for( block = 0; block < nrblocks; block++) { for( int i=0; i < 4; i++){ int pos = block*4+i; for( int j=0; j < 4; j++){ r = find_rank_of_doc (4*block+j) relative to doc (block*4+i); if( r < 4) totaltopcount++; } } } score = totaltopcount/(nrblocks*4); What we are measuring is how many of the images are found on average.Getting everything right gives a score of 4Getting nothing right gives a score of 0Getting only identical image right gives a score of 1A score of 3 means that we find the identical image plus 2 of the 3 other images of the set. Semiprocessed DataWe have computed lots of semiprocessed data along with SIFT vectors for training.Semiprocessed DataThis page is maintained by Henrik Stewénius |
相关文章推荐
- C#图像处理之霓虹效果实现方法
- C#图像亮度调整的方法
- C#实现图像锐化的方法
- C#图像透明度调整的方法
- C#数字图象处理之图像灰度化方法
- C#图像处理之头发检测的方法
- C#图像处理之图像目标质心检测的方法
- C#实现图像反色的方法
- 从jsp发送动态图像
- C#数字图像处理之图像缩放的方法
- C++将CBitmap类中的图像保存到文件的方法
- C#图像重新着色的方法
- 使用CamanJS在Web页面上处理图像的技巧
- php中将指针移动到数据集初始位置的实现代码[mysql_data_seek]
- PHP图像处理类库MagickWand用法实例分析
- php图像处理类实例
- php对图像的各种处理函数代码小结
- C#控制图像旋转和翻转的方法
- C# Console利用mspaint打开图像并保存的方法
- C#实现在图像中绘制文字图形的方法