Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LLM inference on Lunar Lake 258v causes system reboot #1435

Open
endomorphosis opened this issue Dec 25, 2024 · 4 comments
Open

LLM inference on Lunar Lake 258v causes system reboot #1435

endomorphosis opened this issue Dec 25, 2024 · 4 comments
Assignees

Comments

@endomorphosis
Copy link

it did not appear to have anything to do with running out of system ram, I the only difference between the vanilla implementation from the examples list, and the code that I wrote in python that causes the system crash on the ov_model.generate() function, is that I have also have cuda dependencies, because I am writing some code that auto-loads models regardless of hardware platform and model architecture, and multiplexes the inference endpoints from api providers.

https://github.com/endomorphosis/ipfs_accelerate_py/blob/a1cb9ca8e0d8623bf8ddc66daed350ff2cf27dfd/ipfs_accelerate_py/worker/skillset/hf_llava.py#L325

@endomorphosis endomorphosis changed the title LLM inference on Lunar Lake 258v causes system causes system reboot LLM inference on Lunar Lake 258v causes system reboot Dec 25, 2024
@endomorphosis
Copy link
Author

The computer has rebooted from a bugcheck. The bugcheck was: 0x00000124 (0x0000000000000000, 0xffff830fc8602028, 0x00000000b2000000, 0x0000000008210402). A dump was saved in: C:\windows\MEMORY.DMP. Report Id: 3599d5d5-5f01-4290-8f01-34ac5c70de4f.

https://huggingface.co/datasets/endomorphosis/LunarLake_Crash_MemDump/resolve/main/MEMORY.DMP

@Wan-Intel
Copy link

Could you please share the following information with us to further investigate the issue?

  • Python version
  • OpenVINO™ GenAI version
  • Hardware specifications
  • Host Operating System
  • List any steps we should take to reproduce the error you are seeing
  • Additional environment information
  • If Other Deep Learning Framework, please specify
  • If applicable, Deep Learning Framework used

@endomorphosis
Copy link
Author

image
Can you please download this file so that I can remove it from huggingface, given that the memory dump has (most likely) intel github credentials in it.

@endomorphosis
Copy link
Author

Could you please share the following information with us to further investigate the issue?

  • Python version
    3.10.11
  • OpenVINO™ GenAI version
    2024.6
  • Hardware specifications
    Lunar Lake PC, on the Tiber Devcloud
  • Host Operating System
    Windows 11
  • List any steps we should take to reproduce the error you are seeing

I have provided the code where the breakpoint should be placed.

https://github.com/endomorphosis/ipfs_accelerate_py/blob/a1cb9ca8e0d8623bf8ddc66daed350ff2cf27dfd/ipfs_accelerate_py/worker/skillset/hf_llava.py#L325

Here is the entrypoint to that code.

https://github.com/endomorphosis/ipfs_accelerate_py/blob/a1cb9ca8e0d8623bf8ddc66daed350ff2cf27dfd/ipfs_accelerate_py/ipfs_accelerate.py#L1627

The code occurs intermittently once every 3 or 4 runs.

  • Additional environment information
    Memory dump information was provided.

#1435 (comment)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants