The datasets collected in our works are introduced at the following. For more detailed info you can refer to my Scholar page.
Persian Morphologically Segmented Lexicon 0.5. Free
This dataset includes 45300 Persian word forms which are manually segmented into sequences of morphemes. Lemmas and some extra information about those words are also included.