Describe the issue
Hi there,
Thank you for sharing this repository and your work!
I am currently trying to run the fine-tuning code of LeanK, but I encountered an issue downloading the booksum dataset. It seems that the link provided in the code (or documentation) is no longer valid.
The broken link:
https://huggingface.co/datasets/togethercomputer/Long-Data-Collections/resolve/main/fine-tune/booksum.jsonl.zst
the code position: hip-research/src/hip_research/dataset/booksum.py
When accessing this URL, it returns a 404 error (or indicates the file is missing).
Could you please check if the dataset has been moved to a new location or provide an alternative link to download it?
Thank you for your time and help!
Best regards,
Describe the issue
Hi there,
Thank you for sharing this repository and your work!
I am currently trying to run the fine-tuning code of LeanK, but I encountered an issue downloading the booksum dataset. It seems that the link provided in the code (or documentation) is no longer valid.
The broken link:
https://huggingface.co/datasets/togethercomputer/Long-Data-Collections/resolve/main/fine-tune/booksum.jsonl.zst
the code position: hip-research/src/hip_research/dataset/booksum.py
When accessing this URL, it returns a 404 error (or indicates the file is missing).
Could you please check if the dataset has been moved to a new location or provide an alternative link to download it?
Thank you for your time and help!
Best regards,