A RESEARCH ON RETRIEVING AND PARSING OF MULTIPLE WEB PAGES FOR STORING THEM IN LARGE DATABASES
This paper intends to present one of the studies we jointly done during the research for our Ph.D. theses. Cristian Bucur`s thesis aim is to study how the knowledge stored in web pages from various sources can be retrieved and classified. Bogdan Tudorica`s thesis aim is to study the ways to manage large quantities of data for various purposes (especially through use of new technologies, such as NoSQL databases. As such, the application we are describing in this paper is a mixed one, containing both web page crawling and parsing and data storage in a commonly used NoSQL database.
Year of publication: |
2012
|
---|---|
Authors: | Cristian, BUCUR ; George, TUDORICA Bogdan |
Published in: |
Revista Economica. - Facultatea de Ştiinţe Economice. - Vol. Supplement.2012, 5, p. 23-30
|
Publisher: |
Facultatea de Ştiinţe Economice |
Saved in:
Saved in favorites
Similar items by person
-
A new application for the management of the MongoDB servers
George, Tudorica Bogdan, (2013)
-
A proposed validation method for a benchmarking methodology
George, Tudorica Bogdan, (2014)
-
A New Application for the Management of the MongoDB Servers
George, Tudorica Bogdan, (2013)
- More ...