Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 8 Next »


1- Use Github

2- documentation ( How-to use )

3- Use OpenSource/freeware/shared script/program when possible

4- This project will have many phases:   

phase 1- python script  ( create a word list and translate )
phase 2- website: subtitle search, check if already exist and create a Quizlet 
phase 3- script apply to many subtitle languages / TV series with many episode 
phase 4- graph on the website ( statistics of words usage per single subtitle files and multiple subtitle files )  
phase 5- Integrate learning progress from Quizlet ( or other ) to create new list of words/ per subtitle files


Today python script:  Phase 1

1- srt substitute file >>> remove the time stamps create 2 files : list of text subtitle + list of "description information" example 

2- from the "subtitles words only "

1- count all the words  ( summary ) 

2- count only the unique words ( provide stat of words / subtitle text + unique words summary )

3- More advance: count only the same word for: masculin/feminim, also "la + noums" /"el + noums" = 1 words ,  various tense verbs = 1 words 

3- create a list + translation

1- translate the words, order based on the count per subtitle

2- more advance: 

+ add the letter "V"  ( verb)  and add the infinitive form of the verb 

+ add the letter "N" ( noums ) ,.... >>> for the Noums add " el" or "la"

+ may be other . . . ???? 



Future:

1- create an website to search / download subtitle file ( srt )

2- run the precedent script

3- create an Quizlet ( https://quizlet.com/ )

3- compare list of file >>> define the delta/diff  >> create only the new words 




  • No labels