From 75151fc56c42f2fd2176cc0677159a169c0c22a0 Mon Sep 17 00:00:00 2001
From: Lj Miranda <12949683+ljvmiranda921@users.noreply.github.com>
Date: Fri, 20 Sep 2024 21:45:28 -0700
Subject: [PATCH 1/4] Update research
---
research/index.md | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/research/index.md b/research/index.md
index 065ab571..1a713ce5 100644
--- a/research/index.md
+++ b/research/index.md
@@ -46,7 +46,7 @@ profiles for more updated information.
*At AI2, I'm working on various aspects of LM adaptation such as preference data collection and evaluation. I also expanded my work in the multilingual NLP front (SEACrowd, SIGTYP).*
- [SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages](https://arxiv.org/abs/2406.10118)
-
*ArXiV preprint '24*
+
*ArXiV preprint '24, EMNLP (Long Paper) '24*
Holy Lovenia\*, Rahmad Mahendra\*, Salsabil Maulana Akbar\*, Lester James Miranda\*, and 50+ other authors *(∗: major contributor)*.
[[Catalogue](https://seacrowd.github.io/seacrowd-catalogue)] [[Code](https://github.com/SEACrowd/seacrowd-datahub)]
From fb439fa741e2a09cf98d65c54af220276616a084 Mon Sep 17 00:00:00 2001
From: Lj Miranda <12949683+ljvmiranda921@users.noreply.github.com>
Date: Thu, 26 Sep 2024 16:28:00 -0700
Subject: [PATCH 2/4] Update index.md
---
research/index.md | 4 ++--
1 file changed, 2 insertions(+), 2 deletions(-)
diff --git a/research/index.md b/research/index.md
index 1a713ce5..65fee56c 100644
--- a/research/index.md
+++ b/research/index.md
@@ -46,12 +46,12 @@ profiles for more updated information.
*At AI2, I'm working on various aspects of LM adaptation such as preference data collection and evaluation. I also expanded my work in the multilingual NLP front (SEACrowd, SIGTYP).*
- [SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages](https://arxiv.org/abs/2406.10118)
-
*ArXiV preprint '24, EMNLP (Long Paper) '24*
+
*EMNLP '24, ArXiV preprint '24*
Holy Lovenia\*, Rahmad Mahendra\*, Salsabil Maulana Akbar\*, Lester James Miranda\*, and 50+ other authors *(∗: major contributor)*.
[[Catalogue](https://seacrowd.github.io/seacrowd-catalogue)] [[Code](https://github.com/SEACrowd/seacrowd-datahub)]
- [Consent in Crisis: The Rapid Decline of the AI Data Commons](https://arxiv.org/abs/2407.14933)
-
*ArXiV preprint '24*
+
*NeurIPS D&B '24, ArXiV preprint '24*
Data Provenance Initiative Team (40+ authors). I contributed in the annotation process design for Web Domain services and annotation quality review.
[[Website](https://www.dataprovenance.org/)] [[Collection](https://github.com/Data-Provenance-Initiative/Data-Provenance-Collection)] [[New York Times Feature](https://www.nytimes.com/2024/07/19/technology/ai-data-restrictions.html)]
From 61d107881ce592d080649357731d9fcc9e099dd5 Mon Sep 17 00:00:00 2001
From: Lj Miranda <12949683+ljvmiranda921@users.noreply.github.com>
Date: Mon, 7 Oct 2024 13:09:09 -0700
Subject: [PATCH 3/4] Update index.md
---
research/index.md | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/research/index.md b/research/index.md
index 65fee56c..0d94487a 100644
--- a/research/index.md
+++ b/research/index.md
@@ -66,7 +66,7 @@ profiles for more updated information.
### 2023
-*I spent the early parts of 2023 working on low-resource languages and multilinguality, especially Tagalog, my native language. I mostly focused on core NLP tasks: POS tagging, NER, dependency parsing, etc.*
+*I spent the early parts of 2023 working on low-resource languages and multilinguality, especially Tagalog, my native language. I mostly focused on linguistic tasks such as POS tagging, NER, and dependency parsing*
- [calamanCy: a Tagalog Natural Language Processing Toolkit](https://aclanthology.org/2023.nlposs-1.1/)
*NLP Open-Source Software (NLP-OSS) Workshop @ EMNLP '23*
From cc7525b79a83377970bfa06642b0a65fcc8f1525 Mon Sep 17 00:00:00 2001
From: Lj Miranda <12949683+ljvmiranda921@users.noreply.github.com>
Date: Mon, 7 Oct 2024 13:09:28 -0700
Subject: [PATCH 4/4] Update index.md
---
research/index.md | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/research/index.md b/research/index.md
index 0d94487a..7aee8a96 100644
--- a/research/index.md
+++ b/research/index.md
@@ -66,7 +66,7 @@ profiles for more updated information.
### 2023
-*I spent the early parts of 2023 working on low-resource languages and multilinguality, especially Tagalog, my native language. I mostly focused on linguistic tasks such as POS tagging, NER, and dependency parsing*
+*I spent the early parts of 2023 working on low-resource languages and multilinguality, especially Tagalog, my native language. I mostly focused on linguistic tasks such as POS tagging, NER, and dependency parsing.*
- [calamanCy: a Tagalog Natural Language Processing Toolkit](https://aclanthology.org/2023.nlposs-1.1/)
*NLP Open-Source Software (NLP-OSS) Workshop @ EMNLP '23*