Skip to content

Nexdata-AI/480000-corrected-texts-in-German-Spanish-French-Italian

Repository files navigation

480000-corrected-texts-in-German-Spanish-French-Italian

Description

This dataset focuses on the four major European languages (French, German, Spanish, Italian) and contains 480000 pairs of original and corrected text pairs. Each piece of data is presented in JSON format, including two fields: input (raw text) and output (corrected text), which can assist in natural language processing, machine translation, and language teaching research.

For more details, please refer to the link: https://www.nexdata.ai/datasets/speechrecog/1515?source=Github

Specifications

Data content

Text pairs of original and corrected texts for four European languages

Data volume

480000 pairs

Languages

French, German, Spanish, Italian

Field

input,output

Format

JSON

Licensing Information

Commercial License

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published