The patents retrieval prototype in the MOLTO project
Document typeConference report
PublisherACM Press. Association for Computing Machinery
Rights accessRestricted access - publisher's policy
This paper describes the patents retrieval prototype developed within the MOLTO project. The prototype aims to provide a multilingual natural language interface for querying the content of patent documents. The developed system is focused on the biomedical and pharmaceutical domain and includes the translation of the patent claims and abstracts into English, French and German. Aiming at the best retrieval results of the patent information and text content, patent documents are preprocessed and semantically annotated. Then, the annotations are stored and indexed in an OWLIM semantic repository, which contains a patent speci c ontology and others from di erent domains. The prototype, accessible online at http://molto-patents. ontotext.com, presents a multilingual natural language interface to query the retrieval system. In MOLTO, the multilingualism of the queries is addressed by means of the GF Tool, which provides an easy way to build and maintain controlled language grammars for interlingual translation in limited domains. The abstract representation obtained from the GF is used to retrieve both the matched RDF instances and the list of patents semantically related to the user's search criteria. The online interface allows to browse the retrieved patents and shows on the text the semantic annotations that explain the reason why any particular patent has matched the user's criteria.
CitationChechev, M. [et al.]. The patents retrieval prototype in the MOLTO project. A: International World Wide Web Conference. "WWW'12: proceedings of the 21st International conference companion on World Wide Web". Lyon: ACM Press. Association for Computing Machinery, 2012, p. 231-234.