Jose Javier Saiz Anton
Primary tabs
Biography
José Javier Saiz is a research engineer at the Language Technologies Unit of the Barcelona Supercomputing Centre (BSC). His background is in Modern Languages and Translation, and he transitioned into NLP with a Masters in Language Analysis and Processing from the University of the Basque Country (UPV/EHU). He has expertise in data workflow management and is interested in qualitative analysis of textual data.
Research
Brack, Manuel, Malte Ostendorff, Pedro Ortiz Suarez, José Javier Saiz, Iñaki Lacunza Castilla, Jorge Palomar-Giner, Alexander Shvets, et al. “Community OSCAR: A Community Effort for Multilingual Web Data.” In Proceedings of the Fourth Workshop on Multilingual Representation Learning (MRL 2024), edited by Jonne Sälevä and Abraham Owodunni, 232–35. Miami, Florida, USA: Association for Computational Linguistics, 2024. https://doi.org/10.18653/v1/2024.mrl-1.19.
Palomar-Giner, Jorge, Jose Javier Saiz, Ferran Espuña, Mario Mina, Severino Da Dalt, Joan Llop, Malte Ostendorff, et al. “A CURATEd CATalog: Rethinking the Extraction of Pretraining Corpora for Mid-Resourced Languages.” In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), edited by Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, and Nianwen Xue, 335–49. Torino, Italia: ELRA and ICCL, 2024. https://aclanthology.org/2024.lrec-main.31.
Ruiz-Fernández, Valle, José Saiz, and Aitor Gonzalez-Agirre. “BSC-LANGTECH at FIGNEWS 2024 Shared Task: Exploring Semi-Automatic Bias Annotation Using Frame Analysis.” In Proceedings of The Second Arabic Natural Language Processing Conference, edited by Nizar Habash, Houda Bouamor, Ramy Eskander, Nadi Tomeh, Ibrahim Abu Farha, Ahmed Abdelali, Samia Touileb, et al., 620–29. Bangkok, Thailand: Association for Computational Linguistics, 2024. https://doi.org/10.18653/v1/2024.arabicnlp-1.67.
Karunakaran, Begoña Altuna y Rodrigo Agerri y Lidia Salas-Espejo y José Javier Saiz y Alberto Lavelli y Bernardo Magnini y Manuela Speranza y Roberto Zanoli y Goutham. “Overview of TESTLINK at IberLEF 2023: Linking Results to Clinical Laboratory Tests and Measurements.” Procesamiento Del Lenguaje Natural 71, no. 0 (2023): 313–20.
Saiz, José Javier, and Begoña Altuna. “End-to-End Temporal Relation Extraction in the Clinical Domain.” In Text2Story@ ECIR, 13–23, 2023. https://www.di.ubi.pt/~jpaulo/T2S/paper2.pdf.