Commit
·
7dc4bf9
1
Parent(s):
5dddb0d
Add arXiv link.
Browse files
README.md
CHANGED
|
@@ -170,6 +170,8 @@ license: cc-by-4.0
|
|
| 170 |
|
| 171 |
### `espnet/geolid_combined_shared_trainable`
|
| 172 |
|
|
|
|
|
|
|
| 173 |
This geolocation-aware language identification (LID) model is developed using the [ESPnet](https://github.com/espnet/espnet/) toolkit. It integrates the powerful pretrained [MMS-1B](https://huggingface.co/facebook/mms-1b) as the encoder and employs [ECAPA-TDNN](https://arxiv.org/pdf/2005.07143) as the embedding extractor to achieve robust spoken language identification.
|
| 174 |
|
| 175 |
The main innovations of this model are:
|
|
@@ -177,7 +179,7 @@ The main innovations of this model are:
|
|
| 177 |
2. Conditioning the intermediate representations of the self-supervised learning (SSL) encoder on intermediate-layer information.
|
| 178 |
This geolocation-aware strategy greatly improves robustness, especially for dialects and accented variations.
|
| 179 |
|
| 180 |
-
For further details on the geolocation-aware LID methodology, please refer to our paper: *Geolocation-Aware Robust Spoken Language Identification* (arXiv
|
| 181 |
|
| 182 |
### Usage Guide: How to use in ESPnet2
|
| 183 |
|
|
|
|
| 170 |
|
| 171 |
### `espnet/geolid_combined_shared_trainable`
|
| 172 |
|
| 173 |
+
[Paper](https://arxiv.org/pdf/2508.17148)
|
| 174 |
+
|
| 175 |
This geolocation-aware language identification (LID) model is developed using the [ESPnet](https://github.com/espnet/espnet/) toolkit. It integrates the powerful pretrained [MMS-1B](https://huggingface.co/facebook/mms-1b) as the encoder and employs [ECAPA-TDNN](https://arxiv.org/pdf/2005.07143) as the embedding extractor to achieve robust spoken language identification.
|
| 176 |
|
| 177 |
The main innovations of this model are:
|
|
|
|
| 179 |
2. Conditioning the intermediate representations of the self-supervised learning (SSL) encoder on intermediate-layer information.
|
| 180 |
This geolocation-aware strategy greatly improves robustness, especially for dialects and accented variations.
|
| 181 |
|
| 182 |
+
For further details on the geolocation-aware LID methodology, please refer to our paper: *Geolocation-Aware Robust Spoken Language Identification* ([arXiv](https://arxiv.org/pdf/2508.17148)).
|
| 183 |
|
| 184 |
### Usage Guide: How to use in ESPnet2
|
| 185 |
|