Change the repository type filter
All
Repositories list
36 repositories
spatial_intuitions
Publiccisnlp.github.io
Public- Tracing Multilingual Factual Knowledge Acquisition in Pretraining
manchu-in-context-mt
PublicMEXA
Public🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual AlignmentGlotCC
Public🕸 GlotCC Dataset and Pipline -- NeurIPS 2024GlotWeb
Public🕸 GlotWeb: Web Indexing for Low-Resource Languages -- under construction.code-specific-neurons
Public💻🔍 How Programming Concepts and Neurons Are Shared in Code Language ModelsGlotLID
Public💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023oscar-io
Publicungoliant
Publicoscar-tools
PublicLangSAMP
PublicLangSAMP: Language-Script Aware Multilingual Pretraininganalogical_reasoning
PublicTransliteration-PPA
PublicBreaking the Script Barrier in Multilingual Pre-Trained Language Models with Transliteration-Based Post-Training Alignmentlohoravens-webpage
PublicMaskLID
Public💬 MaskLID: Code-Switching Language Identification through Iterative Masking -- ACL 2024GlotScript
Public🖋 Resource and Tool for Writing System Identification -- LREC 2024Taxi1500
PublicTransMI
PublicTransMI: A Framework to Create Strong Baselines from Multilingual Pretrained Language Models for Transliterated DataTransliCo
PublicTransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language ModelsSpatial_Schemas
PublicXAMPLER
PublicGlot500
PublicGlot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023GlotSparse
PublicGlotSparse: Building Corpora in Under-Resourced LanguagesGlotStoryBook
PublicChildren StoryBooks for 180 langauges.mPLM-Sim
PublicColexificationNet
PublicCrosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphs