Interface for Querying and Data Mining for the IMDb Dataset

Document Type

Conference Proceeding

Publication Date


Journal / Book Title



This paper describes the design and implementation of a tool to extract the IMDb dataset files and import them into a database. This approach differs from other published tools or research in that the previous work used relational databases. This tool uses document oriented data structures, and allows others to augment the code to change structures based on their needs. The project development required the use of technologies currently in demand for web developers and software engineers, which allows other developers to fork a copy of the work and utilize in their own work. In addition, it provided the project team an opportunity to develop additional marketable skills. Finally, a web interface to perform queries against the import data to validate the import process was also developed. These queries include searching by people's names, searching by movie/tv titles, or viewing specific data on an individual person or movie/tv title‥



Published Citation

Butler, M., & Robila, S. (2016, April). Interface for querying and data mining for the IMDb dataset. In 2016 IEEE Long Island Systems, Applications and Technology Conference (LISAT) (pp. 1-6). IEEE.