-
Notifications
You must be signed in to change notification settings - Fork 56
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Segmentation Fault w/ transform_jsonl() #23
Comments
Does the example here work? |
It seems as though when I run the script everything works as intended; however, when I try to access via Python, I run into the seg faults. I've just tried running the test_transform_jsonl.py and got the same error. |
Thanks @StephenTemp for reporting this. Could you please share the version of Python and version of torch you have installed? Also what operating system are you on? I would also be curious to know if the
To run that you'll need to create the I'd like to isolate whether it's the downloading and processing of images (which happens in both |
Thanks, @StephenTemp . I had a typo in Barack (missed the c) and as a result the example we tried didn't include an image. Could you re-run the test with
Or re-run with any profile that includes a picture. Sorry about that. |
Thanks, @StephenTemp . One more question is the version of PIL you have
My best guess @zijwang is that this is somewhere in the download or resizing of images given the code runs on a profile with no image but fails on a profile with an image. Given it's a seg fault, it must be in compiled code, which leads me to think it's something with PIL or how we pass the image data to PIL in a m3inference/m3inference/preprocess.py Lines 22 to 55 in bdaf727
I'll see if I can recreate on my OS (Linux) using the same versions. |
PIL: 8.2.0, thanks for your help! |
@StephenTemp I did a test run on 1) a valid screen name with a profile, 2) a valid screen name without a profile, and 3) a non-exist screen name with an Ubuntu machine and everything works just fine. Here is my
I also tried PIL 8.2.0 and things are still working fine. One thing you may try is to add more log prints to |
@StephenTemp -- checking in to see whether this issue has been resolved :) |
Looks like this could be the same issue as #26 It might be specially related to the arm64/M1 chipset, which would explain why we couldn't reproduce it |
Apologies for the wait; it's been a cluttered semester! Yes, unfortunately I couldn't work around the transform_json() function but was able to run the inference itself. It seems that the failure occurs in scripts/m3twitter.py on line |
I believe I've installed m3-inference correctly, but running transform_jsonl() on a json lines file of tweets seems to fetch the first profile picture in the list and then terminate with a segmentation fault.
I believe the file is structured appropriately, in the format below:
{json object}\n
{json object}\n
...
Any idea what I might be running into?
The text was updated successfully, but these errors were encountered: