Interface for Querying and Data Mining for the IMDb Dataset
Document Type
Conference Proceeding
Publication Date
6-16-2016
Journal / Book Title
IEEE
Abstract
This paper describes the design and implementation of a tool to extract the IMDb dataset files and import them into a database. This approach differs from other published tools or research in that the previous work used relational databases. This tool uses document oriented data structures, and allows others to augment the code to change structures based on their needs. The project development required the use of technologies currently in demand for web developers and software engineers, which allows other developers to fork a copy of the work and utilize in their own work. In addition, it provided the project team an opportunity to develop additional marketable skills. Finally, a web interface to perform queries against the import data to validate the import process was also developed. These queries include searching by people's names, searching by movie/tv titles, or viewing specific data on an individual person or movie/tv title‥
DOI
10.1109/LISAT.2016.7494103
Montclair State University Digital Commons Citation
Butler, Martin and Robila, Stefan, "Interface for Querying and Data Mining for the IMDb Dataset" (2016). Department of Computer Science Faculty Scholarship and Creative Works. 350.
https://digitalcommons.montclair.edu/compusci-facpubs/350
Published Citation
Butler, M., & Robila, S. (2016, April). Interface for querying and data mining for the IMDb dataset. In 2016 IEEE Long Island Systems, Applications and Technology Conference (LISAT) (pp. 1-6). IEEE.