From 75151fc56c42f2fd2176cc0677159a169c0c22a0 Mon Sep 17 00:00:00 2001 From: Lj Miranda <12949683+ljvmiranda921@users.noreply.github.com> Date: Fri, 20 Sep 2024 21:45:28 -0700 Subject: [PATCH 1/4] Update research --- research/index.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/research/index.md b/research/index.md index 065ab571..1a713ce5 100644 --- a/research/index.md +++ b/research/index.md @@ -46,7 +46,7 @@ profiles for more updated information. *At AI2, I'm working on various aspects of LM adaptation such as preference data collection and evaluation. I also expanded my work in the multilingual NLP front (SEACrowd, SIGTYP).* - [SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages](https://arxiv.org/abs/2406.10118) -
*ArXiV preprint '24* +
*ArXiV preprint '24, EMNLP (Long Paper) '24*
Holy Lovenia\*, Rahmad Mahendra\*, Salsabil Maulana Akbar\*, Lester James Miranda\*, and 50+ other authors *(∗: major contributor)*.
[[Catalogue](https://seacrowd.github.io/seacrowd-catalogue)] [[Code](https://github.com/SEACrowd/seacrowd-datahub)] From fb439fa741e2a09cf98d65c54af220276616a084 Mon Sep 17 00:00:00 2001 From: Lj Miranda <12949683+ljvmiranda921@users.noreply.github.com> Date: Thu, 26 Sep 2024 16:28:00 -0700 Subject: [PATCH 2/4] Update index.md --- research/index.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/research/index.md b/research/index.md index 1a713ce5..65fee56c 100644 --- a/research/index.md +++ b/research/index.md @@ -46,12 +46,12 @@ profiles for more updated information. *At AI2, I'm working on various aspects of LM adaptation such as preference data collection and evaluation. I also expanded my work in the multilingual NLP front (SEACrowd, SIGTYP).* - [SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages](https://arxiv.org/abs/2406.10118) -
*ArXiV preprint '24, EMNLP (Long Paper) '24* +
*EMNLP '24, ArXiV preprint '24*
Holy Lovenia\*, Rahmad Mahendra\*, Salsabil Maulana Akbar\*, Lester James Miranda\*, and 50+ other authors *(∗: major contributor)*.
[[Catalogue](https://seacrowd.github.io/seacrowd-catalogue)] [[Code](https://github.com/SEACrowd/seacrowd-datahub)] - [Consent in Crisis: The Rapid Decline of the AI Data Commons](https://arxiv.org/abs/2407.14933) -
*ArXiV preprint '24* +
*NeurIPS D&B '24, ArXiV preprint '24*
Data Provenance Initiative Team (40+ authors). I contributed in the annotation process design for Web Domain services and annotation quality review.
[[Website](https://www.dataprovenance.org/)] [[Collection](https://github.com/Data-Provenance-Initiative/Data-Provenance-Collection)] [[New York Times Feature](https://www.nytimes.com/2024/07/19/technology/ai-data-restrictions.html)] From 61d107881ce592d080649357731d9fcc9e099dd5 Mon Sep 17 00:00:00 2001 From: Lj Miranda <12949683+ljvmiranda921@users.noreply.github.com> Date: Mon, 7 Oct 2024 13:09:09 -0700 Subject: [PATCH 3/4] Update index.md --- research/index.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/research/index.md b/research/index.md index 65fee56c..0d94487a 100644 --- a/research/index.md +++ b/research/index.md @@ -66,7 +66,7 @@ profiles for more updated information. ### 2023 -*I spent the early parts of 2023 working on low-resource languages and multilinguality, especially Tagalog, my native language. I mostly focused on core NLP tasks: POS tagging, NER, dependency parsing, etc.* +*I spent the early parts of 2023 working on low-resource languages and multilinguality, especially Tagalog, my native language. I mostly focused on linguistic tasks such as POS tagging, NER, and dependency parsing* - [calamanCy: a Tagalog Natural Language Processing Toolkit](https://aclanthology.org/2023.nlposs-1.1/)
*NLP Open-Source Software (NLP-OSS) Workshop @ EMNLP '23* From cc7525b79a83377970bfa06642b0a65fcc8f1525 Mon Sep 17 00:00:00 2001 From: Lj Miranda <12949683+ljvmiranda921@users.noreply.github.com> Date: Mon, 7 Oct 2024 13:09:28 -0700 Subject: [PATCH 4/4] Update index.md --- research/index.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/research/index.md b/research/index.md index 0d94487a..7aee8a96 100644 --- a/research/index.md +++ b/research/index.md @@ -66,7 +66,7 @@ profiles for more updated information. ### 2023 -*I spent the early parts of 2023 working on low-resource languages and multilinguality, especially Tagalog, my native language. I mostly focused on linguistic tasks such as POS tagging, NER, and dependency parsing* +*I spent the early parts of 2023 working on low-resource languages and multilinguality, especially Tagalog, my native language. I mostly focused on linguistic tasks such as POS tagging, NER, and dependency parsing.* - [calamanCy: a Tagalog Natural Language Processing Toolkit](https://aclanthology.org/2023.nlposs-1.1/)
*NLP Open-Source Software (NLP-OSS) Workshop @ EMNLP '23*