Please use the following text to cite this item or export to a predefined format:
Arantza,Otegi; Oier,Imaz; Arantza,Diaz de Ilarraza; Larraitz,Uria and Mikel,Iruskieta, 2017, ANALHITZA: a tool to extract linguistic information from large corpora in Humanities research, Dspace HiTZ Zentroa, https://hdl.handle.net/20.500.14614/8
dc.contributor.author | Arantza,Otegi |
dc.contributor.author | Oier,Imaz |
dc.contributor.author | Arantza,Diaz de Ilarraza |
dc.contributor.author | Larraitz,Uria |
dc.contributor.author | Mikel,Iruskieta |
dc.date.accessioned | 2024-11-20T10:54:57Z |
dc.date.available | 2024-11-20T10:54:57Z |
dc.date.issued | 2017-03-01 |
dc.description.abstract | The reduced size of corpora in some areas of research is due to the lack of tools to process massively and easily the language under study. In this article, we present ANALHITZA, a tool which is being developed within the Clarink project, whose aim is the creation of linguistic technologies that are useful for research on Social Sciences and Humanities. ANALHITZA has been designed to extract linguistic information online from large corpora in an easy way. Besides, it is a multilingual tool which can process texts written in three languages: Basque, Spanish and English. Moreover, we present three real examples of study where ANALHITZA has been used. The tool can be redesigned or changed, according to the needs of the scientific community in the field of Humanities. |
dc.identifier.citation | Otegi, A., Imaz, O., Díaz de Ilarraza Sánchez, A., Iruskieta Quintian, M., & Uria Garin, L. (2017). ANALHITZA: a tool to extract linguistic information from large corpora in Humanities research. SEPLN. |
dc.identifier.uri | https://hdl.handle.net/20.500.14614/8 |
dc.language.iso | Basque |
dc.language.iso | English |
dc.language.iso | Spanish |
dc.rights | Public Domain Mark (PD) |
dc.rights.label | Publi |
dc.rights.uri | http://creativecommons.org/publicdomain/mark/1.0/ |
dc.source.uri | https://www.ixa.eus/node/8827 |
dc.subject | Tool |
dc.subject | language technologies |
dc.subject | corpora |
dc.subject | text analysis |
dc.subject | PoS |
dc.title | ANALHITZA: a tool to extract linguistic information from large corpora in Humanities research |
dc.type | toolService |
local.contact.person | Mikel Iruskieta mikel.iruskieta@ehu.eus HiTZ - Ixa taldea (UPV/EHU) |
local.demo.uri | https://ixa2.si.ehu.eus/clarink/analhitza.php?lang=en |
local.sponsor | ownFunds IT935-16 Eusko Jaurlaritza IXA taldea A motako ikertalde finkatua |
metashare.ResourceInfo#ContentInfo.detailedType | tool |
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent | true |