This project entails gathering a wide range of Arabic text documents from diverse sources, including literary works, news articles, and user-generated content. These texts are then meticulously annotated to facilitate deeper language understanding and model training.