KAIST Logo

Department of Computer Science
Div. of Web Science and Technology
Korea Advanced Institute of Science and Technology (KAIST)

CS688/WST665: Web-Scale Image Retrieval (Fall 2014)


CS688/WST665: Web-Scale Image Retrieval (Fall 2014)

Instructor: Sung-eui Yoon


When and where: 4:00-5:15pm on Tue. and Thur. at Room 3445 in the CS building
First class: Sep-2 (Important announce for the first class)
Textbook: In-class handouts and ongoing draft on image search
Board: Noah board
Previous Board(Fall 2012): Noah board(2012)
Question Page: Question Submission
Paper Submission Page: Paper Summary Submission


Line

Outline

  • Course overview
  • Lectures and tentative schedule
  • Student presentations
  • Additional reference materials
  • Line

    Course overview

    We extract feature points between two similar images and match them, followed by overlaying them together based on the matched points in the bottom row.

    Thanks to rapid advances of digital camera and various image processing tools, we can easily create new pictures, images, and videos for various purposes. This in turn results in a huge amount of images in the internet and even in personal computers. For example, flickr, an image hosting website, contains more than five billion images and flickr members update more than three thousands image every minute.

    These huge image databases pose numerous technical challenges in terms of image processing, searching, storing, etc. In this class we will discuss various scalable techniques for web-scale image/video databases and novel applications that can utilize such data.

    In summary, what you will get at the end of the course:

  • Broad understanding on image/video retrieval techniques
  • In-depth knowledge on recent methods that can handle web-scale data
  • Study novel applications that utilize web-data
  • What you will do:

  • Choose and present a few papers from a paper list
  • Final project: come up with your own idea related to the topic, (optionally) implement it to improve the state-of-the-art techniques
  • Mid-term exam: reviewing basic image retrieval methods
  • Line

    Lecture schedule (subject to change)

    # of lecture, date Topics and slides Related material(s)
    Sep-2 (T)
    Overview on the course and course policy Important announcement on the first class
    Sep-4 (Th)
    Sep-11 (Th)
    Keypoint localization
    Scale Invariant Region Selection
    Code of Harris detector
    Code of Blob Laplacian
    Sep-16 (T)
    Sep-18 (Th)
    Descriptor
    Intro to Object Recognition
    Programming Assignment1
    Sep-23 (T)
    Sep-25 (Th)
    Bag-of-Words(BoW) Models Programming Assignment2
    PA2 dataset
    Student Presentation Guidance
    Sep-30 (T)
    Oct-2 (Th)
    Recent Image Retrieval Techniques
    Oct-7 (T)
    Invited Talk on Geometric Computing
    Oct-14 (T)
    Oct-16 (Th)
    Hashing Techniques
    Web-Scale Image Databases and Their Applications
    Project Guidelines
    Oct-21 (T)
    Oct-23 (Th)
    Oct-28 (T) ¹ÚÈñ¿õ ¿ÀÀº¼ö À±¿õÁ÷
    Oct-30 (Th) ±èº´ÁÖ ±èÀο± ÀÌÇü¼®
    Nov-4 (T) ±èÅ¿µ ±èµ¿½Å ³ª¿øÁØ ÀÌÀ±¼®
    Nov-6 (Th) ±èÅ¿¬ ¾çÇöö ÀÌ¿¹°­
    Nov-11 (T) ±Ç¿ë¼± ¹ÚÁø¼± ÀÌ°¡¿µ
    Nov-13 (Th) ±è¼öÁö À¯º´ÀÎ ÃÖâ¹Î
    Nov-18 (T) Mid-project presentation:
    1.À¯º´ÀÎ,ÀÌ¿¹°­
    2.ÀÌÇü¼®,±èº´ÁÖ
    3.¾çÇöö,±èÅ¿µ,±Ç¿ë¼±
    4.±èµ¿½Å,±èÅ¿¬
    Nov-20 (Th) Mid-project presentation:
    1.¹ÚÁø¼±,ÀÌ°¡¿µ
    2.ÀÌÀ±¼®,À±¿õÁ÷,±è¼öÁö
    3.ÃÖâ¹Î,±èÀο±
    4.¿ÀÀº¼ö,³ª¿øÁØ,¹ÚÈñ¿õ
    Nov-25 (T) À±¿õÁ÷ ¿ÀÀº¼ö ±Ç¿ë¼±
    Nov-27 (Th) ÀÌÀ±¼® ±è¼öÁö ±èµ¿½Å
    Dec-2 (T) ÀÌ¿¹°­ ÀÌ°¡¿µ ±èº´ÁÖ ÀÌÇü¼®
    Dec-4 (Th) ±èÀο± À¯º´ÀÎ ±èÅ¿µ
    Dec-9 (T) ¹ÚÁø¼± ³ª¿øÁØ ¾çÇöö
    Dec-11 (Th)
    ÃÖâ¹Î ±èÅ¿¬ ¹ÚÈñ¿õ
    Dec-16 (T)
    Final-project presentation:
    1.±èµ¿½Å,±èÅ¿¬
    2.¹ÚÁø¼±,ÀÌ°¡¿µ
    3.À¯º´ÀÎ,ÀÌ¿¹°­
    4.ÀÌÇü¼®,±èº´ÁÖ
    5.ÃÖâ¹Î,±èÀο±
    6.¾çÇöö,±èÅ¿µ,±Ç¿ë¼±
    7.ÀÌÀ±¼®,À±¿õÁ÷,±è¼öÁö
    8.¿ÀÀº¼ö,³ª¿øÁØ,¹ÚÈñ¿õ
    Dec-18 (Th)
    Line

    Student presentations and reports

    For your presentations, please use the this powerpoint template; paper presentation guideline is available.

    For your final report, please use the this latex template

    Line

    Additional reference materials and links

  • WST665/CS770 homepage at fall of 2011
  • WST665/CS688 homepage at fall of 2012

  • Computer vision resources (papers, code, datasets, etc.):

  • CVPapers
  • Multimedia Information Retrieval
  • VLFeat: contains popular computer vision algorithms including SIFT, MSER, k-means, hierarchical k-means, agglomerative information bottleneck, and quick shift
  • Paper search:

  • Google scholar
  • Tim Rowley's graphics paper collections
  • Ke-Sen Huang's graphics paper collections
  • Line

    Acknowledgements: The course materials are based on those of Prof. Fei-Fei Li, Stanford. Thank you so much! Line

    Copyright 2012. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the author.

    This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.