bug: schema detection on CSV, empty entries are assumed to be strings #131

ramfox · 2018-06-06T17:40:55Z

EG:

Name,Home_Runs,Rank
Barry Bonds,762,1
Hank Aaron,755,2
Babe Ruth,714,
Alex Rodriguez,696,4
Willie Mays,660,5

has schema:

"schema": {
  "items": {
    "items": [
      {
        "title": "name",
        "type": "string"
      },
      {
        "title": "home_runs",
        "type": "integer"
      },
      {
        "title": "rank",
        "type": "string"
      }
    ],
    "type": "array"
  },
  "type": "array"
}

Instead of

{
   "title": "rank",
  "type": "string"
}

The text was updated successfully, but these errors were encountered:

ramfox · 2018-06-06T20:38:08Z

Propose that in ParseType checks for empty byte slice. If empty we say type is TypeEmpty.

If a row has multiple types, but one of those are TypeEmpty, we disregard those when we determine the schema.

Except then what value do we give this field when we marshal into go????

b5 added the bug label Jun 11, 2018

b5 added the ready label Feb 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug: schema detection on CSV, empty entries are assumed to be strings #131

bug: schema detection on CSV, empty entries are assumed to be strings #131

ramfox commented Jun 6, 2018

ramfox commented Jun 6, 2018 •

edited

Loading

bug: schema detection on CSV, empty entries are assumed to be strings #131

bug: schema detection on CSV, empty entries are assumed to be strings #131

Comments

ramfox commented Jun 6, 2018

ramfox commented Jun 6, 2018 • edited Loading

ramfox commented Jun 6, 2018 •

edited

Loading