Skip to main content

CSV Parsing and Printing

One common data encoding is the humble CSV (comma separated values) file. With East you can read and write CSV formatted blobs.

CSV decoding

Suppose you had the following data inside the variable blob:

id,name
1,Alice
2,Bob

Imagine you would like to decode that a tabular data form. CSV files contain a collection of rows, but do not necessarily have a "primary" key. East reads CSV into an array of structs. The above CSV could be decoded to the East value:

[
(id = 1, name = "Alice"),
(id = 2, name = "Bob"),
]

The CsvDecode function accepts a blob and returns an array of structs representing the tabular data.

East functionDescriptionExample usageResult
CsvDecodeRead tabular data from a CSV fileCsvDecode(blob, { id: IntegerType, name: StringType } )[(id = 1, name = "Alice"), (id = 2, name = "Bob")]

The second argument specifies the column names and types. The order has to match that in the CSV file header.

An optional third argument allows you to define parsers for each column. Each parser accepts a string (directly extracted from the CSV) and returns a value with the East type nominated for that column. This is particularly useful for parsing custom date and time formats.

An optional fourth argument allows for CSV parsing options, such as the delimiters or null sentinel value. Here is an example:

CsvDecode(
blob,
{
id: IntegerType,
name: StringType
date: DateTimeType
}
{
// put any custom parsers here,
date: str => Parse(DateTime, str, 'MM-DD-YYYY'),
},
{
// these are the default values
nullString: '',
quoteChar: '"',
escapeChar: '"',
delimiter: [','],
newline: ['\r\n', '\n'], // accept either
header: true,
}
)

CSV encoding

You can perform the reverse operation with CsvEncode, which takes an array of structs and returns a blob.

East functionDescriptionExample usageResult
CsvEncodeWrite tabular data to a CSV fileCsvEncode(array_of_structs)0x69642c6e616d650a312c416c6963650a322c426f62

Here the column names and types are known from the East type of the input.

You can specify custom printers for each column as an optional second argument. Each printer accepts a value of that column type and returns a string. This is often used for custom date and time formats.

An optional fourth argument allows for CSV parsing options, such as the delimiters or null sentinel value. Here is an example:

CsvDecode(
blob,
{
id: IntegerType,
name: StringType
}
{
// put any custom printers here,
date: datetime => Print(datetime, 'MM-DD-YYYY'),
},
{
// these are the default values
nullString: '',
quoteChar: '"',
escapeChar: '"',
delimiter: ',',
newline: '\r\n',
header: true,
}
)

Next steps

Continue to the next section to understand some more advanced topics for East and Elara.