I’m using Windows 11, with tesseract v5.4.0
My goal is to add “Hand Writing” font to the Hebrew language.
I did succeed in creating files: “tif” and “box”, and then “tr”.
But not with creating the trained-tada. I’m getting an Error :
Loaded file heb_hw_lstm/.tr, unpacking... Failed to read continue from: heb_hw_lstm/.tr
Notes:
- I’m using the “best” version, not the “fast”.
(“/tessdata” contains the “best” variant as “heb.traineddata”) - I’m using langdata_lstm
- For POC purposes, the max-pages is only 2
Help will be appreciated, please.
My Script :
#
text2image --text="langdata_lstm/heb/heb.training_text" --outputbase="output/2" --font="Handwriting Regular" --D="output" --fonts_dir="fonts" --max_pages="2"
#
tesseract "output/2.tif" "output/2" -l heb box.train.stderr
#
lstmtraining --stop_training --continue_from="output/2.tr" --traineddata="tessdata/heb.traineddata" --model_output="output/2.traineddata"
Loaded file output/2.tr, unpacking... Failed to read continue from: output/2.tr