multilingual-nlp

Here are 30 public repositories matching this topic...

embeddings-benchmark / mteb

MTEB: Massive Text Embedding Benchmark

benchmark information-retrieval retrieval text-classification clustering sts semantic-search reranking text-embedding sgpt neural-search sentence-transformers sbert multilingual-nlp bitext-mining mteb

Updated Jun 21, 2025
Python

DmitryRyumin / EMNLP-2023-Papers

Star

EMNLP 2023 Papers: Explore cutting-edge research from EMNLP 2023, the premier conference for advancing empirical methods in natural language processing. Stay updated on the latest in machine learning, deep learning, and natural language processing with code included. ⭐ support NLP!

Updated May 18, 2024
Python

cisnlp / Glot500

Star

Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023

multilingual nlp natural-language-processing acl dataset glot xlm multilingual-models xlm-r multilingual-nlp glot500

Updated Apr 20, 2024
Python

shijie-wu / crosslingual-nlp

Star

This repo supports various cross-lingual transfer learning & multilingual NLP models.

natural-language-processing crosslingual-transfer multilingual-nlp

Updated Sep 13, 2023
Python

csebuetnlp / CrossSum

Star

This repository contains the code, data, and models of the paper titled "CrossSum: Beyond English-Centric Cross-Lingual Summarization for 1,500+ Language Pairs" published in Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL’23), July 9-14, 2023.

cross-lingual-summarization cross-lingual-transfer multilingual-nlp

Updated Mar 26, 2024
Python

ceferisbarov / TUMLU

Star

TUMLU: A Unified and Native Language Understanding Benchmark for Turkic Languages

nlp multilingual-nlp llms

Updated Feb 25, 2025
Python

BatsResearch / LexC-Gen

Star

Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.

multilingual sentiment-analysis topic-modeling synthetic-data synthetic-dataset-generation low-resource-languages lexicon-based multilingual-nlp llm

Updated Oct 3, 2024
Python

cisnlp / MEXA

Star

🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment

multilingual evaluation embeddings evaluation-metrics cross-lingual multilingual-nlp large-language-models decoder-only

Updated Apr 6, 2025
Python

cambridgeltl / prompt4bli

Star

On Bilingual Lexicon Induction with Large Language Models (EMNLP 2023). Keywords: Bilingual Lexicon Induction, Word Translation, Large Language Models, LLMs.

Updated Jan 23, 2025
Python

negar-foroutan / multiLMs-lang-neutral-subnets

Star

[EMNLP 2022] Discovering Language-neutral Sub-networks in Multilingual Language Models.

mt5 lottery-ticket-hypothesis mbert cross-lingual-transfer multilingual-language-models multilingual-nlp

Updated Apr 1, 2024
Python

longxudou / multispider

Star

MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing

multilingual natural-language-processing japanese-language semantic-parsing german-language spanish-language text-to-sql french-language multilingual-nlp

Updated Mar 12, 2024
Python

MaLA-LM / mala-500

Star

MaLA-500: Massive Language Adaptation of Large Language Models

multilingual-nlp large-language-models

Updated Apr 24, 2024
Python

swaggy66 / M-ABSA

Star

M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis

multilingual multilingual-nlp multilingual-absa

Updated May 24, 2025
Python

negar-foroutan / multilingual-code-switched-reasoning

Star

[EMNLP 2023 - Findings] Breaking the Language Barrier: Improving Cross-Lingual Reasoning with Structured Self-Attention

multilingual-language-model multilingual-nlp cross-lingual-reasoning code-switched-reasoning

Updated Dec 7, 2023
Python

cambridgeltl / sail-bli

Star

Self-Augmented In-Context Learning for Unsupervised Word Translation (ACL 2024). Keywords: Bilingual Lexicon Induction, Word Translation, Large Language Models, LLMs.

Updated Aug 12, 2024
Python

deokhk / CBP

Star

Official Repository for Cross-lingual Back-Parsing: Utterance Synthesis from Meaning Representation for Zero-Resource Semantic Parsing (EMNLP 2024)

semantic-parsing multilingual-nlp emnlp2024

Updated Feb 21, 2025
Python

epfl-nlp / ConLID

Star

ConLID: Supervised Contrastive Learning for Low-Resource Language Identification [arXiv - 2025]

language-identification low-resource-languages multilingual-language-models multilingual-nlp

Updated Jun 19, 2025
Python

tchewik / bilingualrsp

Star

The official code and data for the ACL 2024 Findings paper "Bilingual Rhetorical Structure Parsing with Large Parallel Annotations".

discourse-parsing rhetorical-structure-theory multilingual-nlp

Updated Aug 13, 2024
Python

faisaltareque / Multilingual-Sentence-Tokenizer

Star

This Python package is designed for tokenizing sentences in over 40 languages. It serves as a wrapper around various open-source libraries. The package was created to support our work XL-HeadTags. To use it, simply provide the word and its corresponding language to the stemmer, and it will return the stemmed version of the word.

nlp sentence-tokenizer multilingual-nlp

Updated Aug 11, 2024
Python

faisaltareque / Multilingual-Rouge-Scorer

Star

This Python package is used for calculating ROUGE scores and supports over 100 languages by utilizing a multilingual BPE tokenizer. It leverages the mBERT tokenizer and was developed to support our work XL-HeadTags.

nlp summarization rouge-metric multilingual-nlp

Updated Aug 11, 2024
Python

Improve this page

Add a description, image, and links to the multilingual-nlp topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multilingual-nlp topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multilingual-nlp

Here are 30 public repositories matching this topic...

embeddings-benchmark / mteb

DmitryRyumin / EMNLP-2023-Papers

cisnlp / Glot500

shijie-wu / crosslingual-nlp

csebuetnlp / CrossSum

ceferisbarov / TUMLU

BatsResearch / LexC-Gen

cisnlp / MEXA

cambridgeltl / prompt4bli

negar-foroutan / multiLMs-lang-neutral-subnets

longxudou / multispider

MaLA-LM / mala-500

swaggy66 / M-ABSA

negar-foroutan / multilingual-code-switched-reasoning

cambridgeltl / sail-bli

deokhk / CBP

epfl-nlp / ConLID

tchewik / bilingualrsp

faisaltareque / Multilingual-Sentence-Tokenizer

faisaltareque / Multilingual-Rouge-Scorer

Improve this page

Add this topic to your repo