ImageVerifierCode 换一换
格式:PPT , 页数:31 ,大小:2.63MB ,
资源ID:1210250      下载积分:5 金币
快捷下载
登录下载
邮箱/手机:
温馨提示:
快捷下载时,用户名和密码都是您填写的邮箱或者手机号,方便查询和重复下载(系统自动生成)。 如填写123,账号就是123,密码也是123。
特别说明:
请自助下载,系统不会自动发送文件的哦; 如果您已付费,想二次下载,请登录后访问:我的下载记录
支付方式: 支付宝    微信支付   
验证码:   换一换

加入VIP,免费下载
 

温馨提示:由于个人手机设置不同,如果发现不能下载,请复制以下地址【https://www.docduoduo.com/d-1210250.html】到电脑端继续下载(重复下载不扣费)。

已注册用户请登录:
账号:
密码:
验证码:   换一换
  忘记密码?
三方登录: 微信登录   QQ登录   微博登录 

下载须知

1: 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。
2: 试题试卷类文档,如果标题没有明确说明有答案则都视为没有答案,请知晓。
3: 文件的所有权益归上传用户所有。
4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
5. 本站仅提供交流平台,并不能对任何下载内容负责。
6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

版权提示 | 免责声明

本文(2011.11.04-稀疏表示与图像视频标注.ppt)为本站会员(依依)主动上传,道客多多仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。 若此文所含内容侵犯了您的版权或隐私,请立即通知道客多多(发送邮件至docduoduo@163.com或直接QQ联系客服),我们立即给予删除!

2011.11.04-稀疏表示与图像视频标注.ppt

1、稀疏表示与图像视频标注,lihaojieyilei2011年11月4日,1,Sparsity,A signal is sparse if most of its coefficients are (approximately) zero,1. Nonparametric Label-to-Region by SearchXiaobai Liu, Shuicheng Yan, Jiebo Luo, Jinhui Tang, Zhongyang Huang and Hai JinCVPR 20102. Sparse Ensemble Learning for Concept DetectionSh

2、eng Tang, Yan-Tao Zheng, Yu Wang, Tat-Seng ChuaIEEE Trans. on Multimedia, 2012,4,Nonparametric Label-to-Region by Search,Label-to-Region, L2Rpropagate annotated labels for a given single image from the image-level to their corresponding semantic regions,5,L2R,In CV, known as simultaneous object reco

3、gnition and image segmentationUnsupervised learning methodsobject localization: image segmentation along with object classification multi-label image segmentation and classificationcan only handle images either with single major object or with clean background and without occlusions between objects

4、Supervised learning methods, i.e., classifier-based methods, which usually first learn image classifiers to characterize concepts (or keywords) based on the training images, and then identify the images belonging to the specific category E.g., CMRM,6,L2R by Search-Overview,Each label of the image is

5、 used as query for online image search engines to obtain a set of semantically related and visually similar imagesSegment both input image and online images returned from image search engine into local atomic image patches to obtain the so-called bag-of-patches (BOP) representationA label-specific f

6、eature mining procedure is employed for each label to discover distinctive and descriptive features from the proposed Interpolation SIFT (iSIFT) feature pool. These features are used to discover the patch level label-specific representations,7,L2R by Search-Overview,Construct the candidate regions T

7、he continuity-biased sparsity prior is introduced to select a small number of patches from the online images with preference to larger patchesUse a sparse coding formulation to construct. The candidate regions are further ranked based on the reconstruction errors and the top ones are used to derive

8、the label confidence vector for each atomic patch of the input image.A patch clustering procedure is performed on the input image as a post-processing step to obtain the ultimate L2R assignments,8,L2R by Search-Advantages,1. the sparsity and continuity-biased priors are used to ensure the reliabilit

9、y of label assignment, 2. it does not require exact image parsing, which remains an open problem for real world images,3. no generative or discriminative models need be learned for each label, and thus it is extremely scalable for applications with large-scale image sets as well as large semantic on

10、tology,9,L2R by Search-Techniques,Image RepresentationLabel-Specific Feature Mining by SearchSparse Region CodingSparse Region Coding with Continuity-PriorLabel Assignment via Sparse RepresentationPatch Clustering,10,Image Representation,1. Bag-of-PatchesGraph-based segmentation Felzenszwalb2004Resi

11、ze all the images into a roughly equal resolution and initialize each pixel as one atomic patchUse color features to describe the appearance of an initial image patch and apply graph algorithm to merge the smaller patches into larger onesThis step iterates until all the image patches are merged into

12、 one single patch, namely the original image,11,Image Representation,2. Interpolation SIFT featuresSIFT is robust to image noises and scale changes, but is sparseTo interpolate some new interest points between the sparse interest points detected by the standard SIFT detector to enhance the image des

13、cription capability.SIFT 2D Delaunay triangulation interpolation,12,Label-Specific Feature Mining by Search,only a part of the vocabulary is descriptive or informative for the corresponding label,13,ObservationsIn order to capture objects or scenes, the visual representations should have the followi

14、ng properties: i) the visual words should appear on the input image, ii)the visual words that are informative for a specific label should appear more frequently than other words in the images containing the label, or they should be less frequent in the images not containing the label, and iii) the d

15、escriptive visual words should be located on the objects or scenes.,Label-Specific Feature Mining by Search,14,Method:2-stage procedure1. remove the words that do not appear in the input image2. mining with a probabilistic inference frameworkVocabulary W = W1,W2, . . . ,WNW . for each label c1) freq

16、uency of each visual word and 2) co-occurrence of each word with other wordsSelect the top 20% of these ranked words as label-specific representation,Label-Specific Feature Mining by Search,15,Label-Specific Feature Mining by Search,16,Sparse Region Coding,To discover the cross-image region/patch co

17、rrespondence via sparse coding is the feature descriptor of the candidate region is the coefficient vector, whose entries are expected to be zeros except for those samples containing the same label as , and is a noise vector which explicitly accounts for the possible sparse noises,17,Sparse Region C

18、oding,18,Sparse Region Coding with Continuity-Prior,The reconstructions of candidate regions are with sparsity prior, which means that we prefer to select as few patches as possible. Since our goal is to discover the cross image correspondence, it is natural to additionally enforce that the matched

19、image patches are perceptually and spatially coherent.This motivation leads to the preference to image patches with larger size, namely the continuity-biased prior.,19,Sparse Region Coding with Continuity-Prior,20,the derived coefficient is both sparse and continuity-biased,21,The top 5 selected ima

20、ge patches ranked according to the reconstruction coefficients using different priors,22,L2R Assignment via Sparse Representation,Given a candidate region y of the input image and the feature basis matrix A, we first compute its sparse representation by solving (5)Then, we classify y based on how we

21、ll the coefficients associated with all image patches of each label reproduce y.,23,24,25,Experiments,To evaluate the effectiveness of iSIFT feature pool, feature mining procedure and the continuity-biased sparse coding formulation for Label-to-Region assignment taskDatasets: MSRC 9, 500 images, 23

22、categories/labels, region-level ground truthsCOREL, 4000 images, 8 labels, region-level annotationsDataset collected by Stephen 20, 715 images, 7 labels, region-level annotationsBaselines:SVMKNN,26,Experiments,Dense sift: one sift each lattice of 10*10 pixelsSVM-I: svm + dense SiftSVM-II: svm + iSiftKNN-I: knn + dense SiftKNN-II: knn + iSiftLAS-A-I: LAS + dense Sift + sparse priorLAS-A-II: LAS+iSift + sparse priorLAS-B-II: LAS + iSift + sparse prior + continuity-biased priorBing, Google,27,Experiments,28,29,Dicussions,Image with noisy labelsVideo tag localization,30,Thanks & QA,31,

本站链接:文库   一言   我酷   合作


客服QQ:2549714901微博号:道客多多官方知乎号:道客多多

经营许可证编号: 粤ICP备2021046453号世界地图

道客多多©版权所有2020-2025营业执照举报