Please use this identifier to cite or link to this item: https://hdl.handle.net/10356/48449
Full metadata record
DC FieldValueLanguage
dc.contributor.authorLim, Ivan Wei Jie.
dc.date.accessioned2012-04-24T01:44:47Z
dc.date.available2012-04-24T01:44:47Z
dc.date.copyright2012en_US
dc.date.issued2012
dc.identifier.urihttp://hdl.handle.net/10356/48449
dc.description.abstractCrawling social network data can uncover interesting phenomena for a variety of usage. However it is also generally sluggish due to the fact that it requires 3rd party services over a competitive network. These services are provided at their discretion and their usage quota needs to be complied. Therefore, it resulted in the need to accelerate the retrieval process, which is the objective of this project, to exploit parallelism so as to speed up the crawling procedure. The nature of social network data looks very much like a graph. Hence, the Breadth-First Search (BFS) graph traversal technique is revisited to explore for improvements on crawling operations. This project has chosen Google’s social networking platform called Google+ and experimented parallel crawling method based on BFS to increase throughput. The implementation of the experimental system has performed reasonably well over the naive crawling approach, in light of external limitations like Google’s courtesy usage quota of their services. The system was able fetch more data in the same or even shorter amount of time, therefore, increasing efficiency by a few folds. Although the project demonstrated the speed up of the crawling process, there are still rooms for improvement to further scale up the entire job. Using this as a basis, more concepts can still be used to enhance the efficiency of the system.en_US
dc.format.extent46 p.en_US
dc.language.isoenen_US
dc.rightsNanyang Technological University
dc.subjectDRNTU::Engineering::Computer science and engineering::Computer systems organization::Computer system implementationen_US
dc.subjectDRNTU::Engineering::Computer science and engineering::Computer systems organization::Performance of systemsen_US
dc.titleParallel social network crawler systemen_US
dc.typeFinal Year Project (FYP)en_US
dc.contributor.schoolSchool of Computer Engineeringen_US
dc.description.degreeBachelor of Engineering (Computer Science)en_US
dc.contributor.supervisor2Cheng Sheung Chak Jamesen_US
item.fulltextWith Fulltext-
item.grantfulltextrestricted-
Appears in Collections:SCSE Student Reports (FYP/IA/PA/PI)
Files in This Item:
File Description SizeFormat 
SCE11-0336.pdf
  Restricted Access
Main article1.67 MBAdobe PDFView/Open

Page view(s) 50

549
Updated on Mar 17, 2025

Download(s)

6
Updated on Mar 17, 2025

Google ScholarTM

Check

Items in DR-NTU are protected by copyright, with all rights reserved, unless otherwise indicated.