Re: "The rows are not all the same number of columns."

From: Sebastian Karcher <karcher_at_nyob>
Date: Wed, 18 Jan 2023 07:51:19 -0500
To: CODE4LIB_at_LISTS.CLIR.ORG
Hi Max & list,

We get a version of this error a lot when reading Excel or CSV files into
the Dataverse data repository (which has a library that parses tabular data
into .tsv/.tab files) and in 9 out of 10 cases the problem is a multi-line
cell. Those are valid CSV as per RFC4180 (with the line break in double
quotes) and work with tools like Excel/LO, but many parsing libraries can't
handle them and if the linebreak creates a new row, you get a misalignment.

Just guessing, obviously, but worth looking out for.

Sebastian



-- 
Sebastian Karcher, PhD
www.sebastiankarcher.com
Received on Wed Jan 18 2023 - 07:20:17 EST