JURNAL NASIONAL TEKNIK ELEKTRO DAN TEKNOLOGI INFORMASI: Ekstraksi Data pada Tabel dari Halaman Web Menggunakan Pohon Document Object Model

Data on the web page can be available in various formats, such as table. With the growing of web pages, the need to extract data from tables is increasing. Results of the extraction can be used for integration with other web tables or stored in a database. This study discusses the extraction of data...

Full description

Main Author: Memen Akbar, Cici Patmala, Dini Nurmalasari
Format: Jurnal
Language: Bahasa Indonesia
Published: Teknik elektro UGM 2016
Subjects:
Online Access: http://oaipmh-jogjalib.umy.ac.idkatalog.php?opo=lihatDetilKatalog&id=82792
PINJAM
id oai:lib.umy.ac.id:82792
recordtype oai_dc
spelling oai:lib.umy.ac.id:827922021-06-16T13:11:04ZJURNAL NASIONAL TEKNIK ELEKTRO DAN TEKNOLOGI INFORMASI: Ekstraksi Data pada Tabel dari Halaman Web Menggunakan Pohon Document Object ModelMemen Akbar, Cici Patmala, Dini Nurmalasari Data on the web page can be available in various formats, such as table. With the growing of web pages, the need to extract data from tables is increasing. Results of the extraction can be used for integration with other web tables or stored in a database. This study discusses the extraction of data from a table on a web page using a Document Object Model (DOM) tree. The initial step of this extraction process is to transform the HTML document into a DOM tree. Then, by applying search methods Depth First Search (DFS), part of the data in the table is extracted and stored in a CSV file. An engine has been developed using Visual Basic. The results show that the engine can automatically extract data from the table that has the following characteristics: the number of rows and columns are not limited, able to handle all of the table orientation layout, and able to handle tables that are merged cells. Teknik elektro UGM 2016JurnalISBN: ISSN : 2301-4156JNTETI TAHUN VOL NO 2016 5 4 Bahasa Indonesiahttp://oaipmh-jogjalib.umy.ac.idkatalog.php?opo=lihatDetilKatalog&id=82792
institution Universitas Muhammadiyah Yogyakarta
collection Perpustakaan Yogyakarta
language Bahasa Indonesia
topic
spellingShingle
Memen Akbar, Cici Patmala, Dini Nurmalasari
JURNAL NASIONAL TEKNIK ELEKTRO DAN TEKNOLOGI INFORMASI: Ekstraksi Data pada Tabel dari Halaman Web Menggunakan Pohon Document Object Model
description Data on the web page can be available in various formats, such as table. With the growing of web pages, the need to extract data from tables is increasing. Results of the extraction can be used for integration with other web tables or stored in a database. This study discusses the extraction of data from a table on a web page using a Document Object Model (DOM) tree. The initial step of this extraction process is to transform the HTML document into a DOM tree. Then, by applying search methods Depth First Search (DFS), part of the data in the table is extracted and stored in a CSV file. An engine has been developed using Visual Basic. The results show that the engine can automatically extract data from the table that has the following characteristics: the number of rows and columns are not limited, able to handle all of the table orientation layout, and able to handle tables that are merged cells.
format Jurnal
author Memen Akbar, Cici Patmala, Dini Nurmalasari
author_sort Memen Akbar, Cici Patmala, Dini Nurmalasari
title JURNAL NASIONAL TEKNIK ELEKTRO DAN TEKNOLOGI INFORMASI: Ekstraksi Data pada Tabel dari Halaman Web Menggunakan Pohon Document Object Model
title_short JURNAL NASIONAL TEKNIK ELEKTRO DAN TEKNOLOGI INFORMASI: Ekstraksi Data pada Tabel dari Halaman Web Menggunakan Pohon Document Object Model
title_full JURNAL NASIONAL TEKNIK ELEKTRO DAN TEKNOLOGI INFORMASI: Ekstraksi Data pada Tabel dari Halaman Web Menggunakan Pohon Document Object Model
title_fullStr JURNAL NASIONAL TEKNIK ELEKTRO DAN TEKNOLOGI INFORMASI: Ekstraksi Data pada Tabel dari Halaman Web Menggunakan Pohon Document Object Model
title_full_unstemmed JURNAL NASIONAL TEKNIK ELEKTRO DAN TEKNOLOGI INFORMASI: Ekstraksi Data pada Tabel dari Halaman Web Menggunakan Pohon Document Object Model
title_sort jurnal nasional teknik elektro dan teknologi informasi: ekstraksi data pada tabel dari halaman web menggunakan pohon document object model
publisher Teknik elektro UGM
publishDate 2016
url http://oaipmh-jogjalib.umy.ac.idkatalog.php?opo=lihatDetilKatalog&id=82792
isbn ISBN: ISSN : 2301-4156
callnumber-raw JNTETI TAHUN VOL NO 2016 5 4
callnumber-search JNTETI TAHUN VOL NO 2016 5 4
_version_ 1702754762836934656
score 14.79448