This repository provides an introduction to tokenization in SpaCy with practical examples. The code demonstrates how to:
- Tokenize text using SpaCy.
- Access and explore token attributes.
- Extract specific entities like email addresses and URLs.
- Customize tokenization rules.
- Segment text into sentences.
- Perform exercises to extract data from text.
- Clone the repository.
- Install the required packages using pip:
pip install spacy