1- Use Github
2- This project will have many phases:
1- script 2- website: subtitle search, check if already exist and quizlet creation 3- graph on the website 4- script apply to many subtitle languages
Today python script:
1- srt substitute file >>> remove the time stamps create 2 files : list of text subtitle + list of "description information" example
2- from the "subtitles words only "
1- count all the words ( summary )
2- count only the unique words ( provide stat of words / subtitle text + unique words summary )
3- More advance: count only the same word for: masculin/feminim, various tense verbs
3- create a list + translation
1- translate the words, order based on the count per subtitle
2- more advance:
+ add le letter "V" ( verb) and add the infinitive form of the verb
+ add letter "N" ( noums ) ,.... >>> for the Noums add " el" or "la"
+ may be other . . . ????
Future:
1- create an website to search / download subtitle file ( srt )
2- run the precedent script
3- create an Quizlet ( https://quizlet.com/ )
3- compare list of file >>> define the deta >> create only the new words