Keep content of textual, unparsed files to the dataset #151

nguyenhoan · 2017-07-19T15:05:09Z

Useful for tasks that processing contents as textual documents.
Users might also write simple parser in Boa to parse and extract information.

nguyenhoan · 2017-07-25T20:43:45Z

53daba4

psybers · 2017-08-14T18:07:59Z

Should we store all textual files, including ones we parsed? That gives users a consistent capability of being able to analyze any text file (including source).

psybers · 2017-08-14T18:08:13Z

The data should be stored into its own data file, not in the AST sequence file.

psybers · 2017-08-14T18:08:31Z

There needs to be a Boa function to read the file and return the contents for a given ChangedFile.

nguyenhoan · 2017-08-14T21:01:48Z

I don't see benefit of storing parsed files because they are already in the asts.
Storing them would increase the space significantly.

hridesh · 2017-08-14T21:03:43Z

Agreed. Perhaps limit textual contents to unparsed ASCII files?

psybers · 2017-08-14T21:23:27Z

Parsed files do not retain all information (such as whitespace and comments) however. Hence my thoughts to including them.

nguyenhoan added data generation enhancement labels Jul 19, 2017

nguyenhoan closed this as completed Jul 25, 2017

psybers reopened this Aug 14, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Keep content of textual, unparsed files to the dataset #151

Keep content of textual, unparsed files to the dataset #151

nguyenhoan commented Jul 19, 2017

nguyenhoan commented Jul 25, 2017

psybers commented Aug 14, 2017

psybers commented Aug 14, 2017

psybers commented Aug 14, 2017

nguyenhoan commented Aug 14, 2017

hridesh commented Aug 14, 2017 via email

psybers commented Aug 14, 2017

Keep content of textual, unparsed files to the dataset #151

Keep content of textual, unparsed files to the dataset #151

Comments

nguyenhoan commented Jul 19, 2017

nguyenhoan commented Jul 25, 2017

psybers commented Aug 14, 2017

psybers commented Aug 14, 2017

psybers commented Aug 14, 2017

nguyenhoan commented Aug 14, 2017

hridesh commented Aug 14, 2017 via email

psybers commented Aug 14, 2017