1. M. Araneda, F. Bravo-Marquez, D. Parra, and R.F Cádiz MUSIB: Musical Score Inpainting Benchmark. In EURASIP Journal on Audio, Speech, and Music Processing, 2023, 19 (2023). DOI:10.1186/s13636-023-00279-6 (pdf)
  2. G. Iturra-Bocaz and F. Bravo-Marquez RiverText: A Python Library for Training and Evaluating Incremental Word Embeddings from Text Data Stream. In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2023), Taipei, Taiwan. Association for Computing Machinery. Pages 3027–3036. DOI:10.1145/3539618.3591908 (pdf).
  3. Matias Rojas, Casimiro Pio Carrino, Aitor Gonzalez-Agirre, Jocelyn Dunstan, and Marta Villegas. 2022. Assessing the Limits of Straightforward Models for Nested Named Entity Recognition in Spanish Clinical Narratives. In Proceedings of the 13th International Workshop on Health Text Mining and Information Analysis (LOUHI), pages 14–25, Abu Dhabi, United Arab Emirates (Hybrid). Association for Computational Linguistics. (pdf).
  4. Divide and Conquer: An Extreme Multi-Label Classification Approach for Coding Diseases and Procedures in Spanish (Barros et al., Louhi 2022) (pdf).
  5. Claudio Aracena, Fabián Villena, Matias Rojas, and Jocelyn Dunstan. 2022. A Knowledge-Graph-Based Intrinsic Test for Benchmarking Medical Concept Embeddings and Pretrained Language Models. In Proceedings of the 13th International Workshop on Health Text Mining and Information Analysis (LOUHI), pages 197–206, Abu Dhabi, United Arab Emirates (Hybrid). Association for Computational Linguistics. (pdf).


  1. Sipiran, I., Mendoza, A., Apaza, A., Lopez, C.: Data-driven Restoration of Digital Archaeological Pottery with Point Cloud Analysis . International Journal of Computer Vision,130(9), pp 2149–2165. Springer. 2022. (pdf)
  2. Romanengo, C., Raffo, A., Biasotti, S., Falcidieno, B., Fotis, V., Romanelis, I., Psatha, E., Moustakas, K., Sipiran, I., Nguyen, Q., Chu, C., Nguyen-Ngoc, K., Vo, D., To, T., Nguyen, N., Le-Pham, N., Nguyen, H., Tran, M., Qie, Y., & Anwer, N.: SHREC 2022: Fitting and recognition of simple geometric primitives on point clouds. Computers & Graphics. Vol 107, October, pp. 32-49. Elsevier. 2022. Publisher site.
  3. Thompson, E.M., Ranieri, A., Biasotti, S., Chicchón, M., Sipiran, I., Pham, M., Nguyen-Ho, T., Nguyen, H., & Tran, M.: SHREC 2022: pothole and crack detection in the road pavement using images and RGB-D data. Computers & Graphics. Vol 107, October, pp. 161-171. Elsevier. 2022 Publisher site.
  4. Matias Rojas, Felipe Bravo-Marquez, and Jocelyn Dunstan. 2022. Simple Yet Powerful: An Overlooked Architecture for Nested Named Entity Recognition. In Proceedings of the 29th International Conference on Computational Linguistics , pages 2108–2117, Gyeongju, Republic of Korea. International Committee on Computational Linguistics. (pdf)
  5. Matias Rojas, Jose Barros, Kinan Martin, Mauricio Araneda-Hernandez, and Jocelyn Dunstan. 2022. PLN CMM at SocialDisNER: Improving Detection of Disease Mentions in Tweets by Using Document-Level Features. In Proceedings of The Seventh Workshop on Social Media Mining for Health Applications, Workshop & Shared Task , pages 52–54, Gyeongju, Republic of Korea. Association for Computational Linguistics. (pdf)
  6. Clinical Flair: A Pre-Trained Language Model for Spanish Clinical Natural Language Processing (Rojas et al., ClinicalNLP 2022) (pdf)
  7. Rojas, M., Barros, J., Araneda, M., & Dunstan, J. (2022). FLERT-Matcher: A Two-Step Approach for Clinical Named Entity Recognition and Normalization. (pdf)
  8. P. Baéz, F. Bravo-Marquez, J. Dunstan, M. Rojas, and F. Villena Automatic Extraction of Nested Entities in Clinical Referrals in Spanish. In ACM Transactions on Computing for Healthcare (HEALTH), Volume 3, Issue 3, July 2022. Pages 1–22. DOI:10.1145/3498324 (pdf)
  9. J. Diaz, F. Bravo-Marquez and B. Poblete, Language Modeling on Location-Based Social Networks. In ISPRS International Journal of Geo-Information , Volume 11, Number 2, Article Number 147, February 2022. DOI: 10.3390/ijgi11020147 (pdf)
  10. F. Bravo-Marquez and C. Tamblay Words, Tweets and Reviews: Leveraging Affective Knowledge Between Multiple Domains. In Cognitive Computation, Volume 14, January 2022. Pages 388-406. DOI: 10.1007/s12559-021-09923-9 (pdf)
  11. F. Bravo-Marquez, A. Khanchandani, and B. Pfahringer Incremental word-vectors for time-evolving sentiment lexicon induction. In Cognitive Computation, Volume 14, January 2022. Pages 425-441. DOI:10.1007/s12559-021-09831-y (pdf)
  12. H. Sarmiento, F. Bravo-Marquez, E. Graells-Garrido, and B. Poblete Identifying and Characterizing New Expressions of Community Framing during Polarization. In Proceedings of the 16th The International AAAI Conference on Web and Social Media (ICWSM 2022), Atlanta, Georgia, USA. AAAI Press. Pages 841-851. (pdf)
  13. F. D. Zamora-Reina, F. Bravo-Marquez, and D. Schlechtweg LSCDiscovery: A shared task on semantic change discovery and detection in Spanish. In Proceedings of the 3rd International Workshop on Computational Approaches to Historical Language Change (LCHANGE 2022), co-located with ACL 2022, Dublin, Ireland. Association for Computational Linguistics. Pages 149–164. (pdf), (codalab)
  14. V. Araujo, A. Caravallo, S. Kundu, J. Cañete, M. Mendoza, R. E. Mercer, F. Bravo-Marquez, M. Moens, and A. Soto Evaluation Benchmarks for Spanish Sentence Representations. In Proceedings of the 13th Edition of The Language Resources and Evaluation Conference (LREC 2022) , Marseille, France. Pages 6024-6034. (pdf)
  15. J. Cañete, S. Donoso, F. Bravo-Marquez, A. Caravallo, and V. Araujo ALBETO and DistilBETO: Lightweight Spanish Language Models. In Proceedings of the 13th Edition of The Language Resources and Evaluation Conference (LREC 2022) , Marseille, France. Pages 4291-4298. (pdf)
  16. Aymé Arango, Jorge Pérez, Barbara Poblete, Hate speech detection is not as easy as you may think: A closer look at model validation (extended version). Inf. Syst. 105: 101584 (2022)
  17. Jesus Perez-Martin, Benjamin Bustos, Silvio Jamil Ferzoli Guimarães, Ivan Sipiran, Jorge Pérez, Grethel Coello Said: A comprehensive review of the video-to-text problem. Artif. Intell. Rev. 55(5): 4165-4239 (2022)


  1. Manuel Alfonseca, Manuel Cebrián, Antonio Fernández Anta, Lorenzo Coviello, Andrés Abeliuk, Iyad Rahwan: Superintelligence Cannot be Contained: Lessons from Computability Theory. J. Artif. Intell. Res. 70: 65-76 (2021)
  2. Hernan Sarmiento, Barbara Poblete: Crisis communication: a comparative study of communication patterns across crisis events in social media. SAC 2021: 1711-1720
  3. F. Tobar, F. Bravo-Marquez, J. Dunstan, J. Fontbona, A. Maass, and D. Remenik, and J.F. Silva Data Science for Engineers: A Teaching Ecosystem. In IEEE Signal Processing Magazine, Volume 38, Issue 3, May 2021. Pages 144-153. DOI:10.1109/MSP.2021.3053551 (pdf)
  4. A. Ansell, F. Bravo-Marquez, and B. Pfahringer PolyLM: Learning about Polysemy through Language Modeling. In Proceedings of the 16th conference of the European Chapter of the Association for Computational Linguistics (EACL 2021), Kyiv, Ukraine. Pages 563–574. (pdf),(code).
  5. J. Cerezo, A. Bergel, and F. Bravo-Marquez Tools Impact on the Quality of Annotations for Chat Untangling. In Proceedings of the 2021 ACL-IJCNLP Student Research Workshop (SRW), Bangkok, Thailand. (pdf)
  6. J. Muñoz and F. Bravo-Marquez Interventions Recommendation: Professionals’ Observations Analysis in Special Needs Education. In Proceedings of the 16th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2021), Co-located with EACL 2021, Kyiv, Ukraine. Pages 171-179 (pdf),(code).
  7. Jocelyn Dunstan, Fabián Villena, Jorge Pérez, René Lagos: Supporting the classification of patients in public hospitals in Chile by designing, deploying and validating a system based on natural language processing. BMC Medical Informatics Decis. Mak. 21(1): 208 (2021)
  8. Aimei Yang, Ian Myoungsu Choi, Andrés Abeliuk, Adam J. Saffer, The Influence of Interdependence in Networked Publics Spheres: How Community-Level Interactions Affect the Evolution of Topics in Online Discourse. J. Comput. Mediat. Commun. 26(3): 148-166 (2021)
  9. Zihao He, Negar Mokhberian, António Câmara, Andrés Abeliuk, Kristina Lerman: Detecting Polarized Topics Using Partisanship-aware Contextualized Topic Embeddings. EMNLP (Findings) 2021: 2102-2118
  10. Nathan Bartley, Andrés Abeliuk, Emilio Ferrara, Kristina Lerman: Auditing Algorithmic Bias on Twitter. WebSci 2021: 65-73


  1. Barbara Poblete, Jorge Pérez: Minding the AI gap in LATAM. Commun. ACM 63(11): 61-63 (2020)
  2. Jose Miguel Herrera, Denis Parra, Barbara Poblete: Social QA in non-CQA platforms. Future Gener. Comput. Syst. 105: 631-649 (2020)
  3. Henry Rosales-Méndez, Aidan Hogan, Barbara Poblete: Fine-Grained Entity Linking. J. Web Semant. 65: 100600 (2020)
  4. Javier Carrasco, Aidan Hogan, Jorge Pérez: Laconic Image Classification: Human vs. Machine Performance. CIKM 2020: 115-124
  5. Jorge Pérez, Francisco Plana: Food sharing gave birth to social networks. CogSci 2020
  6. P. Báez, F. Villena, M. Rojas, M. Durán, and J. Dunstan The Chilean Waiting List Corpus: a new resource for clinical Named Entity Recognition in Spanish, In Proceedings of the 3rd Clinical Natural Language Processing Workshop, November, 291-300, 2020. DOI:10.18653/v1/2020.clinicalnlp-1.32 (pdf)
  7. Model Interpretability through the Lens of Computational Complexity Pablo Barceló, Mikael Monet, Jorge Pérez, Bernardo Subercaseaux NeurIPS 2020
  8. J. Diaz, B. Poblete, and F. Bravo-Marquez An Integrated Model for Textual Social Media Data with Spatio-Temporal Dimensions, In Information Processing & Management, Volume 57, Issue 5, 2020. DOI:10.1016/j.ipm.2020.102219 (pdf)
  9. D.G. Trye, A.S. Calude, F. Bravo-Marquez, and T.T. Keegan Hybrid Hashtags: #YouKnowYoureAKiwiWhen your Tweet contains Māori and English, In Frontiers in Artificial Intelligence, section Language and Computation Volume 3, Article 15, April 2020. DOI: 10.3389/frai.2020.00015. (pdf|supplementary Material)
  10. P. Badilla, F. Bravo-Marquez, and J. Pérez WEFE: The Word Embeddings Fairness Evaluation Framework In Proceedings of the 29th International Joint Conference on Artificial Intelligence and the 17th Pacific Rim International Conference on Artificial Intelligence (IJCAI-PRICAI 2020), Yokohama, Japan. Pages 430-436. DOI:10.24963/ijcai.2020/60. Acceptance rate: 12.6%. (pdf),(code).
  11. The Logical Expressiveness of Graph Neural Networks Pablo Barceló, Egor V. Kostylev, Mikael Monet, Jorge Pérez, Juan Reutter and Juan-Pablo Silva, ICLR 2020 (talk, slides, poster)
  12. Spanish Pre-Trained BERT Model and Evaluation Data Jose Cañete, Gabriel Chaperon, Rodrigo Fuentes, Jou-Hui Ho, Hojin Kang and Jorge Pérez PML4DC @ ICLR 2020 (talk, slides, code)
  13. Predicting Unplanned Readmissions with Highly Unstructured Data Constanza Fierro, Jorge Pérez, and Javier Mora, AI4AH @ ICLR 2020.
  14. Jesus Perez-Martin, Benjamin Bustos, Jorge Pérez: Attentive Visual Semantic Specialized Network for Video Captioning. ICPR 2020: 5767-5774


  1. Jorge Pérez, Javier Marinković and Pablo Barceló, On the Turing Completeness of Modern Neural Network Architectures, ICLR 2019. (pdf) (poster)
  2. Aymé Arango, Jorge Pérez, Barbara Poblete , Hate Speech Detection is Not as Easy as You May Think: A Closer Look at Model Validation, SIGIR 2019. (pdf)
  3. Pablo Barceló, Nelson Higuera, Jorge Pérez and Bernardo Subercaseaux, Expressiveness of Matrix and Tensor Query Languages in terms of ML Operators, DEEM @ SIGMOD 2019. (pdf) (slides)
  4. F. Bravo-Marquez, E. Frank, B. Pfahringer, and S. M. Mohammad AffectiveTweets: a WEKA Package for Analyzing Affect in Tweets, In Journal of Machine Learning Research 20(92): Pages 1−6, 2019. (pdf)
  5. S. Lang, F. Bravo-Marquez, C. Beckham, M. Hall, and E. Frank WekaDeeplearning4j: a Deep Learning Package for Weka based on DeepLearning4j, In Knowledge-Based Systems, Volume 178, 15 August 2019, Pages 48-50. DOI: 10.1016/j.knosys.2019.04.013 (pdf)
  6. A. Ansell, F. Bravo-Marquez, and B. Pfahringer An ELMo-inspired approach to SemDeep-5's Word-in-Context task. In Proceedings of the 5th Workshop on Semantic Deep Learning (SemDeep-5) co-located with IJCAI 2019 in Macau, China. (pdf)
  7. D. Trye, A. S. Calude, F. Bravo-Marquez, and T. T Keegan Māori Loanwords: A Corpus of New Zealand English Tweets. In Proceedings of the 2019 ACL Student Research Workshop (SRW), Florence, Italy. (pdf)
  8. F. Villena and J. Dunstan Obtención automática de palabras clave en textos clínicos: una aplicación de procesamiento del lenguaje natural a datos masivos de sospecha diagnóstica en Chile. In Revista médica de Chile, Volume 147, 2019. DOI:http://dx.doi.org/10.4067/s0034-98872019001001229 (pdf)
  9. Marcelo Mendoza0000-0002-7969-6041, Barbara Poblete, Ignacio Valderrama: Nowcasting earthquake damages with Twitter. EPJ Data Sci. 8(1): 3:1-3:23 (2019)
  10. Henry Rosales-Méndez, Aidan Hogan, Barbara Poblete: Fine-Grained Evaluation for Entity Linking. EMNLP/IJCNLP (1) 2019: 718-727
  11. Marcelo Mendoza0000-0002-7969-6041, Bárbara Poblete, Ignacio Valderrama: Estimating Ground Shaking Regions with Social Media Propagation Trees. HCI (13) 2019: 356-369
  12. Mauricio Quezada, Barbara Poblete: A Lightweight Representation of News Events on Social Media. SIGIR 2019: 1049-1052
  13. Juglar Diaz, Barbara Poblete: Car Theft Reports: a Temporal Analysis from a Social Media Perspective. WWW (Companion Volume) 2019: 779-782
  14. Karen Oróstica, Barbara Poblete: Mining the Relationship BetweenCar Theft and Places of Social Interest in Santiago Chile. WWW (Companion Volume) 2019: 811-814
  15. Henry Rosales-Méndez, Aidan Hogan, Barbara Poblete: NIFify: Towards Better Quality Entity Linking Datasets. WWW (Companion Volume) 2019: 815-818