We embarked on a rigorous process to gather and annotate a vast array of Hindi text files, spanning multiple genres and styles. This included literary works, news articles, and conversational scripts, ensuring a rich and varied dataset that reflects the complexity and nuances of the Hindi language.