Distributed high-dimensional index creation using Hadoop, HDFS and C++

Gylfi Pór Gudmundsson*, Laurent Amsaleg, Björn Pór Jónsson

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Citations (Scopus)

Abstract

This paper describes an initial study where the open-source Hadoop parallel and distributed run-time environment is used to speedup the construction phase of a large high-dimensional index. This paper first discusses the typical practical problems developers may run into when porting their code to Hadoop. It then presents early experimental results showing that the performance gains are substantial when indexing large data sets.

Original languageEnglish
Title of host publication2012 10th International Workshop on Content-Based Multimedia Indexing, CBMI 2012
Pages83-88
Number of pages6
DOIs
Publication statusPublished - 2012
Event2012 10th International Workshop on Content-Based Multimedia Indexing, CBMI 2012 - Annecy, Haute-Savoie, France
Duration: 27 Jun 201229 Jun 2012

Publication series

NameProceedings - International Workshop on Content-Based Multimedia Indexing
ISSN (Print)1949-3991

Conference

Conference2012 10th International Workshop on Content-Based Multimedia Indexing, CBMI 2012
Country/TerritoryFrance
CityAnnecy, Haute-Savoie
Period27/06/1229/06/12

Fingerprint

Dive into the research topics of 'Distributed high-dimensional index creation using Hadoop, HDFS and C++'. Together they form a unique fingerprint.

Cite this