This dataset focuses on the four major European languages (French, German, Spanish, Italian) and contains 480000 pairs of original and corrected text pairs. Each piece of data is presented in JSON format, including two fields: input (raw text) and output (corrected text), which can assist in natural language processing, machine translation, and language teaching research.
For more details, please refer to the link: https://www.nexdata.ai/datasets/speechrecog/1515?source=Github
Text pairs of original and corrected texts for four European languages
480000 pairs
French, German, Spanish, Italian
input,output
JSON
Commercial License