Chunk size is ignored when EOF is not a new line using ReadableStream #931

rcambrj · 2022-04-20T11:27:55Z

When using a chunk size bigger than the ReadableStream, the last line is ignored in the first chunk and a second chunk is always provided.

given:

title,name
test title 01,test name 01
last line,last line

(without a new line at EOF)

running:

var readStream = fs.createReadStream(__dirname + '/sample.csv', 'utf8');
Papa.parse(readStream, {
	header: false,
	chunkSize: 4000000000,
	chunk: function(item) {
		console.log('>>>> chunk ' + JSON.stringify(item.data));
	},
});

will output:

>>>> chunk [["title","name"],["test title 01","test name 01"]]
>>>> chunk [["last line","last line"]]

This seems to be related to ignoreLastRow which was introduced in #135

Is this expected? Surely in a small enough file, there should be only one chunk.

The text was updated successfully, but these errors were encountered:

DavidCockerill · 2022-05-17T17:02:44Z

I am seeing this also.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chunk size is ignored when EOF is not a new line using ReadableStream #931

Chunk size is ignored when EOF is not a new line using ReadableStream #931

rcambrj commented Apr 20, 2022

DavidCockerill commented May 17, 2022

Chunk size is ignored when EOF is not a new line using ReadableStream #931

Chunk size is ignored when EOF is not a new line using ReadableStream #931

Comments

rcambrj commented Apr 20, 2022

DavidCockerill commented May 17, 2022