Focused Web Crawler Dengan Sistem Terdistribusi

49

DAFTAR PUSTAKA

Achsan, H. T. Y. & Wibowo, W. C., 2013 . A Fast Distributed Focused-Web Crawling.
24th DAAAM International Symposium on Intelligent Manufacturing and Automation,
p. 492 – 499.
Apache Ignite, 2016. Apache Ignite. [Online]
Available at: https://ignite.apache.org
[Diakses 16 September 2016].
APJII, 2015. Profil Pengguna Internet Indonesia 2014. Jakarta: Asosiasi Penyelenggara
Jasa Internet Indonesia.
Avraam, I. & Anagnostopoulos, I., 2011. A Comparison over Focused Web Crawling
Strategies. 15th Panhellenic Conference On Informatics (PCI), p. 245 – 249.
Baeza-Yates, R., Marin, M., Castillo, C. & Rodriguez, A., 2005. Crawling a Country:
Better Strategies than Breadth-First for Web Page Ordering.
Chakrabarti, S., Berg, M. v. d. & Dom, B., 1999. Focused Crawling: A New Approach
to Topic-Specific Web Resource Discovery. Computer Networks, pp. 1623-1640.
Coulouris, G. F., Dollimore, J. & Kindberg, T., 2012. Distributed Systems: Concepts
and Design. 5 penyunt. Boston: Addison Wesley.
Ikatan Dokter Anak Indonesia, 2015. IDAI - Public Articles. [Online]

Available at: http://www.idai.or.id/artikel
[Diakses 11 Juli 2016].
Janbandhu, R., Dahiwale, P. & Raghuwanshi, . M., 2014. Analysis of Web Crawling
Algorithms. International Journal on Recent and Innovation Trends in Computing and
Communication, p. 488 – 492 .
Kateglo, 2016. Kateglo ~ Kamus, tesaurus, dan glosarium bahasa Indonesia. [Online]
Available at: http://www.kateglo.com/
[Diakses 1 Agustus 2016].
Kementerian Kesehatan Republik Indonesia, 2013. Kementerian Kesehatan Republik
Indonesia - Kamus. [Online]
Available at: http://www.depkes.go.id/folder/view/full-content/structure-kamus.html
[Diakses 21 Juli 2016].
Khodra, L. M. & Wibisono, Y., 2005. Clustering Berita Berbahasa Indonesia. Jurnal
FPMIPA UPI dan KK Informatika ITB.
Kohlschütter, C., 2016. Boilerpipe. [Online]
Available at: https://boilerpipe-web.appspot.com/
[Diakses 15 September 2016].

Universitas Sumatera Utara


50

Kohlschütter, C., Fankhauser, P. & Nejdl, W., 2010. Boilerplate Detection using
Shallow Text Features. The third ACM international conference on Web search and
data mining, pp. 441-450.
Kritikopoulos, A., Sideri, M. & Stroggilos, K., 2004. CrawlWave: A Distributed
Crawler. 3rd Hellenic Conference on Artificial Intelligence.
Loo, B. T., Cooper, O. & Krishnamurthy, S., 2001. Distributed Web Crawling over
DHTs.
McCallum, A. & Nigam, K., 1998. A Comparison of Event Models for Naive Bayes
Text Classification. AAAI/ICML-98 Workshop on Learning for Text Categorization, pp.
41-48.
Nasri, M., Shariati, S. & Sharifi, M., 2008. Availability and Accuracy of Distributed
Web Crawlers: A Model-Based Evaluation. Second UKSIM European Symposium on
Computer Modeling and Simulation, pp. 453-458.
Rajaraman, A. & Ullman, J. D., 2011. Mining of Massive Datasets. United Kingdom:
Cambridge University Press.
Salton, M., 1983. Introduction to Modern Information Retrieval. New York: McGraw
Hill.
Seeger, M., 2010. Building Blocks of A Scalable Web Crawler. Tesis. Stuttgart Media

University.
Sharma, S. & Gupta, P., 2015. The Anatomy of Web Crawlers. International
Conference on Computing, Communication and Automation (ICCCA2015), pp. 849853.
Tala, F. Z., 2003. A Study of Stemming Effects on Information Retrieval in Bahasa
Indonesia. Skripsi. Universiteit van Amsterdam.
Treselle System, 2014. Boilerpipe – Web Content Extraction without Boilerplates.
[Online]
Available at: http://www.treselle.com/blog/boilerpipe-web-content-extraction-withoutboiler-plates/
[Diakses 6 Agustus 2016].
Triawati, C., 2009. Metode Pembobotan Statistical Concept Based untuk Klastering dan
Kategorisasi Dokumen Berbahasa Indonesia. s.l.:s.n.
Tsai, C. H., Ku, T., Yang, P. Y. & Chen, M. J., 2014. A Distributed Multi-Tasking Job
Scheduling Mechanism for Web Crawlers. International Conference of Soft Computing
and Pattern Recognition, pp. 243-248.
Wang, W. et al., 2010. A Focused Crawler Based on Naive Bayes Classifier. Third
International Symposium on Intelligent Information Technology and Security
Informatics, pp. 517-521.

Universitas Sumatera Utara


51

Weiss, S., Indurkhya, N., Zhang, T. & Damerau, F., 2005. Text Mining: Predictive
Methods fo Analyzing Unstructered Information. New York: Springer.
Wikipedia,
2010.
Multithreading
(computer
architecture).
[Online]
Available at: https://en.wikipedia.org/wiki/Multithreading_(computer_architecture)
[Diakses 15 September 2016].
Wikipedia, 2016. Web crawler. [Online]
Available at: https://en.wikipedia.org/wiki/Web_crawler
[Diakses 15 September 2016].
Zhou, B., Xiao, B., Lin, Z. & Zhang, C., 2010. A Distributed Vertical Crawler Using
Crawling-Period Based Strategy. 2nd International Conference on Future Computer
and Communication, pp. 306-311.

Universitas Sumatera Utara