<!-- NOTE: For questions or install related issues, please open a Discussion instead. --> ## How to reproduce the behaviour ``` import spacy nlp = spacy.load("ja_core_news_md") text = "詳細はこちらのサイトをご覧ください:https://example.com/jp" doc = nlp(text) for token in doc: print(token.text) ``` Output: ``` 詳細 は こちら の サイト を ご覧 ください : https : / / example . com / jp ``` ## Your Environment - **spaCy version:** 3.8.11 - **Platform:** Linux-6.16.1-arch1-1-x86_64-with-glibc2.42 - **Python version:** 3.12.10 - **Pipelines:** ja_core_news_md (3.8.0), it_core_news_md (3.8.0), pt_core_news_md (3.8.0), en_core_web_trf (3.8.0), en_core_web_sm (3.8.0), nl_core_news_md (3.8.0), en_core_web_md (3.8.0), es_core_news_md (3.8.0), fr_core_news_md (3.8.0), de_core_news_md (3.8.0)
How to reproduce the behaviour
Output:
Your Environment