Data Cleaning pada Data Duplikat Menggunakan Leveinsthein Distance

DAFTAR PUSTAKA

Agarwal, N., Rawat, M. & Maheshwari, V. 2014. Comperative analysis of jaccard
coefficient and cosine similarity for web document similarity measure.
International Journal for Advance Research in Engineering and
Technology. 2: 18-21.
Azma, S. 2006. Pembuatan alat bantu dalam proses data cleaning pada intragovermental access to shared information system (IGASIS). Skripsi.
Universitas Telkom.
Chahal, M. 2016. Information retrieval using jaccard similarity coefficient.
International Journal of Computer Trends and Technology(IJCTT). 36
(3): 140-142.
Han, J. & Kamber, M. 2006. Data Mining: Concept and techniques. Second
Edition. Elsevier: The United States of America.
He, L., Zhang, Z., Tan, Y. & Liao, M. 2011. An Efficient Data Cleaning
Algorithm Based on Attributes Selection. 6th International Conference on
Computer Science and Convergence Information Technology (ICCIT),
IEEE, pp. 375-379.
Hermawati, F.A. 2013. Data mining. Yogyakarta: Penerbit Andi.
Liliana., Budhi, G. S., Wibisono, A. & Tanojo, R. 2012. Pengecekan plagiarisme
pada code dalam bahaca C++. Jurnal Informatika. Universitas Kristen
Petra Surabaya. (Online) http://jurnalinformatika.petra.ac.id/index.php

/inf/article/view/18649 (18 Mei 2016).
Prasetyo, E., 2014. Data mining: Mengolah data menjadi informasi menggunakan
matlab. Yogyakarta: Penerbit Andi.
Primadani, Y. 2014. Simulasi algoritma leveinsthein distance untuk fitur
autocomplete pada aplikasi katalog perpustakaan. Skripsi. Universitas
Sumatera Utara.
Rahm, d & Do, H.H. 2000. Data Cleaning: Problem and current approaches.
IEEE Bulletin of the Technical Committee on Data Engineering 23(4): 111.
Riezka, A. 2011. Analisis dan implementasi data cleaning menggunakan metode
multi-pass neighborhod(MPN). Skripsi. Universitas Telkom
Silberschatz, A., Korth, H.F. & Sudarshan, S. 2006. Database system concepts.
5th Edition. Singapore: McGraw Hill.

Universitas Sumatera Utara

43

Tamilselvi, J.J. & Saravan, V., 2010. An Evaluation on Current Research Trends
in Data Cleaning on Data Warehouseing. International Journal of
Computational Intelligence Research 6(3): 405-430.

Ugon, A., Nicolas, T., Richard., M., Guerin, P., Chansard, P., Demoor, C. &
Toubiana L., 2015. A new approach for cleansing geographical dataset
using Leveinsthein distance, prior knowledge and contextual information.
European Federation for Medical Informatics (EFMI) 227-229.

Universitas Sumatera Utara