Tasks

Tasks

1. First milestone
The tasks we want to accomplish for the first milestone are:
 1. Parse the XML files, extract the words to obtain raw text
 2. Do a word count, and store the output as CSV (format: word, #occ, year)
 3. Compute the word temporal profiles (first as a List then as a graph)

Repartition of the work:

2. Second milestone
To cluster the word temporal profiles, we will look into the following:
1. Fourier transform
2. Machine Learning/Artificial Intelligence
3. Time series
 

[To complete]

3. Third milestone [To complete]