# An experimental study of $\gamma \gamma \rightarrow hadrons$ at LEP

Abstract : In this paper, we treat the problems of Part-of-Speech (PoS) tagging of unannotated corpora of specialty. The existing taggers are trained on non-specialized corpora, and most often give inconsistent results on specialized texts. In order to learn rules adapted to a specialized field, the usual approach labels manually a large corpus of this field. This is extremely time-consuming. We propose here a semi-automatic approach for PoS tagging corpora of specialty. ETIQ, the new tagger we are building, make it possible to correct the base of rules obtained by Brill‘s tagger and to adapt it to a corpus of specialty. The expert of the field visualizes a basic tagging and corrects it by the insertion of specialized contextual lexical rules. The inserted rules are more expressive than Brill‘s rules. To help the user in this task, we designed an inductive algorithm biased by the "correct" knowledge acquired beforehand by the user. By using machine learning techniques while allowing the expert to incorporate knowledge of the field in an interactive and convivial way, we improve the tagging of a specialty corpus. Our approach has been applied to a molecular biology corpus.
Keywords :
Document type :
Journal articles

Cited literature [14 references]

http://hal.in2p3.fr/in2p3-00004532
Submitted on : Friday, March 31, 2000 - 2:36:10 PM
Last modification on : Tuesday, April 20, 2021 - 12:00:03 PM
Long-term archiving on: : Friday, May 29, 2015 - 4:42:00 PM

### Identifiers

• HAL Id : in2p3-00004532, version 1

### Citation

D. Buskulic, I. de Bonis, D. Decamp, P. Ghez, C. Goy, et al.. An experimental study of $\gamma \gamma \rightarrow hadrons$ at LEP. Physics Letters B, Elsevier, 1993, 313, pp.509-519. ⟨in2p3-00004532⟩

Record views