|
|
本帖最后由 wangjian4500 于 2014-9-17 11:41 編輯
時(shí)間:2014年9月20日,下午13:30-17:30
地點(diǎn):清華大學(xué)FIT樓二樓多功能廳
Bridging CNN and Unmatched Visual Tasks
顏水城
新加坡國(guó)立大學(xué)副教授、博導(dǎo)
Abstract: In this talk, we shall introduce two recent works toapply deep learning for those tasks on which deep learning is not so natural.
1. In this work, we propose a flexible deepCNN infrastructure, called Hypotheses-CNN-Pooling (HCP), where an arbitrarynumber of object segment hypotheses are taken as the inputs, then a shared CNNis connected with each hypothesis, and finally the CNN output results fromdifferent hypotheses are aggregated with max pooling to produce the ultimatemulti-label predictions. Experimental results on Pascal VOC2007 and VOC2012multi-label image datasets well demonstrate the superiority of the proposed HCPinfrastructure over other state-of-the-arts. In particular, the mAP reaches84.2% by HCP only and 90.3% after the fusion with our complementary result in[47] based on hand-crafted features on the VOC2012 dataset, which significantlyoutperforms the state-of-the-arts with a large margin of more than 7%.
2. In this work, the human parsing task, namelydecomposing a human image into semantic fashion/body regions, is formulated asan Active Template Regression (ATR) problem, where the normalized mask of eachitem is expressed as the linear combination of the learned mask templates, andthen morphed to a more precise mask with the active shape parameters, includingposition, scale and visibility of each semantic region. More specifically, thestructure outputs are predicted by two separate networks. For a new image, thestructure outputs of the two networks are fused to generate the probability ofeach semantic label for each pixel, and super-pixel smoothing is finally used to fine-tune thehuman parsing result. Comprehensiveevaluations on a new large dataset well demonstrate the significant superiorityof the ATR framework over other state-of-the-arts for human parsing.
Bio: Dr. YanShuicheng is currently an Associate Professor at the Department of Electricaland Computer Engineering at National University of Singapore, and the foundinglead of the Learning and Vision Research Group (http ://w ww.lv-nus.org). Dr.Yan's research areas include machine learning, computer vision and multimedia,and he has authored/co-authored hundreds of technical papers over a wide rangeof research topics, with Google Scholar citation >13,000 times and H-index50. He has been serving as an associate editor of IEEE TKDE, TCSVT and ACMTransactions on Intelligent Systems and Technology (ACM TIST). He received theBest Paper Awards from ACM MM'13 (Best Paper and Best Student Paper), ACM MM’12(Best Demo), PCM'11, ACM MM’10, ICME’10 and ICIMCS'09, the runner-up prize ofILSVRC'13, the winner prize of ILSVRC’14 detection task, the winner prizes ofthe classification task in PASCAL VOC 2010-2012, the winner prize of thesegmentation task in PASCAL VOC 2012, the honorable mention prize of thedetection task in PASCAL VOC'10, 2010 TCSVT Best Associate Editor (BAE) Award,2010 Young Faculty Research Award, 2011 Singapore Young Scientist Award, and2012 NUS Young Researcher Award.
Learning with Parallel Vector Field
何曉飛
浙江大學(xué)教授、博導(dǎo)、國(guó)家杰青
Abstract: In this talk, I will introduce our recentwork on manifold learning from the perspective of vector field. Unlike graphbased techniques which try to preserve the distance, our approach tries to finda constant vector field on the manifold and then reconstruct the embeddingfunction via the obtained vector field. When we restrict the vector field tothe gradient field, our approach is equivalent to finding killing vector fieldon manifold. Our analysis of killing field on Euclidean space shows, when themanifold is locally isometric to a connected subset of Euclidean space, we canalways recover the manifold isometrically. I will also present someexperimental results on both synthetic and real data sets.
Bio:何曉飛,博士,浙江大學(xué)教授、博導(dǎo),國(guó)家杰出青年基金獲得者,IEEE高級(jí)會(huì)員。2000年畢業(yè)于浙江大學(xué),獲計(jì)算機(jī)學(xué)士學(xué)位;2005年畢業(yè)于美國(guó)芝加哥大學(xué),獲計(jì)算機(jī)博士學(xué)位;之后加入美國(guó)雅虎公司,任職研究科學(xué)家;2007年作為人才引進(jìn)加入浙江大學(xué),任職教授;曾獲1999年國(guó)際大學(xué)生數(shù)學(xué)建模競(jìng)賽特等獎(jiǎng)。近年來(lái)主要從事人工智能、互聯(lián)網(wǎng)數(shù)據(jù)挖掘及計(jì)算機(jī)視覺(jué)等方面的研究。論文共被他人引用8000余次,其中兩篇代表性論文分別被他人引用上千次,F(xiàn)/曾任7個(gè)國(guó)際SCI學(xué)術(shù)刊物的編委,包括IEEE TKDE、IEEE TCYB、CVIU等。曾近30次擔(dān)任國(guó)際會(huì)議的大會(huì)主席、副主席及程序委員會(huì)委員。獲得2012年人工智能頂級(jí)國(guó)際會(huì)議AAAI的最佳論文獎(jiǎng),以及2010年多媒體領(lǐng)域國(guó)際頂級(jí)會(huì)議ACM Multimedia的最佳論文提名獎(jiǎng)。
Regressionbased Robust Classification
楊健
南京理工大學(xué)教授、博導(dǎo)、國(guó)家杰青
Bio:楊健,2002年7月博士畢業(yè)于南京理工大學(xué)計(jì)算機(jī)學(xué)院模式識(shí)別專(zhuān)業(yè)。自2003年起,先后在西班牙薩拉戈薩大學(xué)、香港理工大學(xué)、美國(guó)新澤西理工學(xué)院、加州理工學(xué)院從事博士后或訪(fǎng)問(wèn)研究。2007年9月起任南京理工大學(xué)計(jì)算機(jī)學(xué)院教授,2014年3月起任南京理工大學(xué)計(jì)算機(jī)科學(xué)與工程學(xué)院副院長(zhǎng)。
長(zhǎng)期從事模式識(shí)別理論與應(yīng)用方面的研究,先后主持了國(guó)家自然科學(xué)基金面上項(xiàng)目,國(guó)家杰出青年科學(xué)基金項(xiàng)目,教育部科學(xué)技術(shù)研究重大項(xiàng)目,國(guó)家973課題等項(xiàng)目。在模式識(shí)別和機(jī)器智能領(lǐng)域的頂級(jí)國(guó)際期刊 IEEE Trans. on PAMI上發(fā)表論文3篇,在其他IEEE Transactions及Pattern Recognition等國(guó)際SCI源期刊上發(fā)表論文70余篇。SCI被引用3000余次,Google Scholar被引用6000余次,F(xiàn)擔(dān)任國(guó)際SCI源學(xué)術(shù)期刊的IEEE Trans. on Neural Networks andLearning Systems和Pattern Recognition Letters的編委。
曾獲國(guó)家自然科學(xué)二等獎(jiǎng)(第二完成人);江蘇省科技進(jìn)步一等獎(jiǎng)(第二完成人);教育部科技進(jìn)步獎(jiǎng)(推廣類(lèi)) 二等獎(jiǎng)(第二完成人);第十一屆中國(guó)青年科技獎(jiǎng);第二屆“SCOPUS尋找青年科學(xué)之星”成長(zhǎng)獎(jiǎng);國(guó)務(wù)院政府特殊津貼;入選國(guó)家百千萬(wàn)人才工程,被授予“有突出貢獻(xiàn)中青年專(zhuān)家”稱(chēng)號(hào)。
AnIntroduction of Micro-expression Recognition
王甦菁
中國(guó)科學(xué)院心理研究所 博士后
Abstract: 在這個(gè)報(bào)告中,我將給大家簡(jiǎn)單介紹一下微表情識(shí)別的一些進(jìn)展。微表情指人們?cè)谠噲D隱藏自己的情緒時(shí)所泄露快速表情。這樣微表情就可以一條很重要的測(cè)謊線(xiàn)索用于測(cè)謊,并有可能被廣泛地應(yīng)用于安全、司法臨床等領(lǐng)域。在本報(bào)告中,列舉了當(dāng)前微表情識(shí)別幾個(gè)公開(kāi)的數(shù)據(jù)庫(kù),并介紹了幾種微表情識(shí)別的新的算法,最后了微表情識(shí)別中所面臨的問(wèn)題。
Bio:王甦菁,2012年7月博士畢業(yè)于吉林大學(xué)計(jì)算機(jī)科學(xué)與技術(shù)學(xué)院,同年8月進(jìn)入中國(guó)科學(xué)院心理研究所從事博士后研究,長(zhǎng)期從事特征抽取在人臉識(shí)別上應(yīng)用基礎(chǔ)研究,在流形學(xué)習(xí),張量分析,彩色空間理論等方面有著深入的研究,并與國(guó)內(nèi)外人臉識(shí)別專(zhuān)家有著長(zhǎng)期密切合作。其成果發(fā)表在本領(lǐng)域國(guó)際權(quán)威期刊IEEE Transactions 系列期刊上發(fā)表多篇論文。擔(dān)任國(guó)際學(xué)術(shù)期刊Neurocomputing的編委。承擔(dān)1項(xiàng)國(guó)家自然科學(xué)基金面上項(xiàng)目“基于稀疏張量的微表情識(shí)別研究”項(xiàng)目,發(fā)表30余篇論文。 |
|