-
Notifications
You must be signed in to change notification settings - Fork 321
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to pinpoint problematic table columns when getting metadata from multiple tables #2323
Comments
Ah, I had a typo in the python snippet in the print statement ( So my first question is resolved. The second question/feature request is whether there's a way to include the table name and column name in the trace for easier debugging. |
Hi @yimingli, thanks for filing this issue. It seems like you have resolved your overall issue of figuring out which columns were problematic? We can certainly track a feature request for a better debugging experience. Before we are able to make an update, it would be helpful to better understand what is going on -- as I have never before seen a case where the column labels (aka column names) are dictionaries instead of strings. Would you be able to explain more about your original data format? How did you load the data into Python to create the |
Not the column names, but column values are dictionaries. Columns names are strings. Does this help? |
Hi @yimingli, thank you for clarifying this. I filed a feature request #2327 for surfacing the name of the column/table to you when the metadata detection crashes, so you won't have this problem next time. In the meantime, we're always looking for feedback to support your use case. I'm wondering if SDV offering official support for dictionary values in your data would be a useful feature for you? If so, I'm curious as to what kind of info is being stored in dictionary format. Do all the dictionaries in each of the data cells have the same key/value pairs? Any more info you can provide would be helpful. Thanks. |
Environment details
If you are already running SDV, please indicate the following details about the environment in
which you are running it:
Problem description
When running
metadata = Metadata.detect_from_dataframes(data=real_data)
, got errorunhashable type: 'dict'
.What I already tried
I thought the error indicates that some dataframe columns are of the 'dict' type, so I used the following snippet to check, and none of the columns are dicts.
I have 2 questions:
unhashable type: 'dict'
mean?Thank you!
Trace:
The text was updated successfully, but these errors were encountered: