-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Model Wizard #4
Comments
Created branch. |
…when used in conjunction with Boolean data types in the Request XPath definitions. See #4 for specification.
Completed task to refactor JSON parsing to support JSON Path expressions. Committed to branch. A working example of a JSON Path expression is: $..[?(@.Brand == 'ZTE')] Note the double stop \ point to signify that the JSON Path is to be tested against a singular object, and not an array object. This JSON Path does not work on all parsers but seems fine in JSON.net. |
…gration which inserts a new Permission Specification and allocates to Administrator in Role Registry Permission. Created a page called EntityAnalysisModelWizard and included the validation of the new Permission Specification in code behind. Included the page location in the shared layout menu also on validation of the new Permission Specification. See #4 for specification.
…when used in conjunction with Boolean data types in the Request XPath definitions. See #4 for specification.
…gration which inserts a new Permission Specification and allocates to Administrator in Role Registry Permission. Created a page called EntityAnalysisModelWizard and included the validation of the new Permission Specification in code behind. Included the page location in the shared layout menu also on validation of the new Permission Specification. See #4 for specification.
Picking this issue back up after a period of inactivity. Some changes to the design will include categorical variables being laid out in the Inline Function rather than using Json XPath processing. This means that the original file format is fully respected as Json and the user does not need to worry about breaking out categorical variables on recall. It was too convoluted to achieve the categorical variable pivoting in the Request XPath, which is what the Inline Function is for anyway. |
Creating a machine learning model in Jube can be a convoluted process involving creating a model, specifying fields to be extracted, specifying tags and then loading data via HTTP endpoint, before being available for training in the embedded Exhaustive machine learning algorithm. The requirements contrast to products which can achieve the same through the application of a CSV file. It follows that despite having more advanced capabilities the adoption may be reduced to other products. While Jube was not designed as an automated machine learning Wizard, there appears increasing overlap
It is proposed that a Model Wizard be created to take a CSV file and parse the metadata and data itself, automatically creating all configuration elements that are otherwise created manually. The file will be parsed for its data to identify the universe of categorical variables, with these being created as Boolean XPath expressions (a process which currently is done typically outside of Jube).
Task: Ensure JSON Path Expression returns a Boolean value
As categorical data pivoting will be done in Jube, JSON Path must be available in the Request XPath Model Configuration to return based on Expression, for example, $.[?(@.=='Politician')].
Task: Create a new page to parse the CSV file
The new page called Model Wizard, existing under the Models menu item, will accept a CSV file as an upload and proceed to parse the headers. For each header the data will be inspected:
In keeping with the stateless nature of the design, the parsing will be stored in tables in the database for recall by the user interface. At this stage, the model will not be created.
Task: Allocate Dependent Variable
With the metadata having been established, the page must accept further configuration parameters, specifically including the dependent variable, which will go on to be a tag value, corresponding Exhaustive Model and Activation Rule.
Task: Create Model
Based on metadata and configuration create the model in Jube comprising:
Task: Load Data from CSV into JSON for storage in the Archive
Transpose the CSV file to a JSON representation and store it in the Archive table which will make the data available for Exhaustive training.
Task: Synchronise Model
Insert data to cause the model to synchronise and thus start Exhaustive training.
The text was updated successfully, but these errors were encountered: