Dataset

The datasets collected in our works are introduced at the following. For more detailed info you can refer to my Scholar page.

E Ansari, Z Žabokrtský, H Haghdoost, M Nikravesh

Persian Morphologically Segmented Lexicon 0.5. Free

This dataset includes 45300 Persian word forms which are manually segmented into sequences of morphemes. Lemmas and some extra information about those words are also included.