Student's name | SCIPER |
---|---|
Pavlo Karalupov | 293393 |
Erdem Böcügöz | 295097 |
Mammad Hajili | 294808 |
Milestone 1 • Milestone 2 • Milestone 3
10% of the final grade
Find a dataset (or multiple) that you will explore. Assess the quality of the data it contains and how much preprocessing / data-cleaning it will require before tackling visualization. We recommend using a standard dataset as this course is not about scraping nor data processing.
The dataset we are using is from Kaggle, authored by originally Paul Mooney. It is about a famous fictional novel by J.R.R.Tolkien - The Lord of the Rings, and a film series based on the novel. The data originally contains two datasets - first one - lotr_characters.csv which is about the characters in the novel, and the second one - lotr_scripts.csv, the scripts from three films.
First dataset contains the following features of characters: Birth, Death, Gender, Hair, Height, Name, Race, Realm, Spouse Second dataset contains dialogs from movies and has 3 columns: Char, Dialog, Movie
Additionally, in the context of LOTR, we have found another interesting source which includes timeline of events in LOTR world. This source provides multiple tables, each corresponds to one of the six ages - "Years of the Lamps", "Years of the Trees", and First, Second, Third, and Fourth Age. We have scraped the information from the tables using BeautifulSoup and build six datasets. The structure of the datasets are almost the same and each has two main columns: Year, Event.
Frame the general topic of your visualization and the main axis that you want to develop. What am I trying to show with my visualization? Think of an overview for the project, your motivation, and the target audience.
Our project will illustrate the lifespan of the characters through different Middle-Earth ages, their interactions through the dialogues, analysis of their roles in each movie such as general sentiment, relations with events, and with each other's.
The main axis we would like to develop are the following:
- What are key characters, and how is their development through time starting from first age to the last?
- How is the link between main characters in movies? How their races, physical appearances and realms are important and relevant?
- How is the evolution of the events through time, what are their affects, and consequences in the novel, and in the context of films?
Please check out Jupyter Notebook for analysis.
- There are some works that allready done some analysis on the data and visualized the results. For example in here you can find very detailed analysis of the same data. Also here you can find a more complex work on a similiar data.
- Our approach will be focused on interesting data analysis and interactive visualization simulteaniously. This way we beleive that we can tell a story from the data and not just show the statistical analysis. Also, as far as we found there is a lack of interactivity in previous works.
- We are inspired by one of the last year's projects that visualized a comic book dataset. You can find the project here. We thought that we can apply similiar logic to a dataset related to Lord of The Rings. Of course, since the data and content is different, in the end they will be two seperate works.
10% of the final grade For this milestone we used Vue, typescript(mainly d3) and vue-bootstrap for our website. We set up build pipeline using npm, webpack and vue-cli, configured linting to follow best practices in code style. We made 3 main charts: chord diagram, map and timeline. But, note that the website is incomplete and just the skeleton of our future work. Below you can find our report and website.
Technical Setup
For project:- vue-cli need to be installed
npm install
npm run serve
npm run build
npm run lint
npm run deploy
80% of the final grade