Document Server@UHasselt >
Research publications >
Please use this identifier to cite or link to this item:
|Title: ||Information extraction from Web documents based on local unranked tree automaton inference|
|Authors: ||Kosala, Raymond|
VAN DEN BUSSCHE, Jan
|Issue Date: ||2003|
|Publisher: ||Kaufman, Morgan|
|Citation: ||Gottlob, G. & Walch, T. (Ed.) Proceedings of the 18th International Joint Conference on Artificial Intelligence. p. 403-408.|
|Abstract: ||Information extraction (IE) aims at extracting specific
information from a collection of documents.
A lot of previous work on IE from semi-structured
documents (in XML or HTML) uses learning techniques
based on strings. Some recent work converts the document to a ranked tree and uses tree automaton induction. This paper introduces an algorithm that uses unranked trees to induce an automaton.
Experiments show that this gives the best results obtained so far for IE from semi-structured documents based on learning.|
|Type: ||Proceedings Paper|
|Appears in Collections: ||Research publications|
Files in This Item:
|N/A||197.45 kB||Adobe PDF|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.