Receive models from emission eval private data #945

humbleOldSage · 2023-12-01T02:28:25Z

Moving all the models related files from e-mission-eval-private-data to e-mission-server. Four files, as below, are moved from TRB_label_assist to emission/analysis/modelling/trip_model :

models.py
clustering.py
mapping.py
data_wrangling.py

I'll link the PR that handles changes on e-mission-eval-private-data side below once I have it ready. This way it'll be easier to track changes on both sides.

Evaluation for TRB 2017 paper

+ modify README

Percom analysis + adapt the notebook script to read config

…that

Make the setup and teardown generic and update the README to reflect …

Also, since we are sourcing the base setup/teardown, they already operate in the current directory. No need for additional copy/remove

Have the teardown delete current conf, not the e-mission conf!

Create a new directory with new setup and teardown scripts

Add the setup and teardown directories for the tripaware paper

To ensure that people don't check in sensitive material

adding notebooks

Currently, this is the same as the list for the e-mission server

Fix the copyright

* Bulk update of repo policies * Bulk update of repo policies * Bulk update of repo policies * Bulk update of repo policies * Bulk update of repo policies * Bulk update of repo policies * Bulk update of repo policies * Bulk update of repo policies * Bulk update of repo policies * Bulk update of repo policies * Bulk update of repo policies * Bulk update of repo policies * Bulk update of repo policies * Bulk update of repo policies * Bulk update of repo policies * Bulk update of repo policies * Bulk update of repo policies

* Create a new environment for the eval which includes the visualization modules from emission instead of polluting the emission environment directly * Fix/modify the setup scripts - add an activate option to quickly activate the environment instead of setting everything up - change the setup code to install emission and the emission viz/notebook code - ensure that we install/activate conda as well

…mission#13)

This is largely a direct copy of the existing `graph` function from `emission/analysis/modelling/tour_model/similarity.py` in https://github.com/e-mission/e-mission-server.git Minor modifications: - move the matplotlib imports out - create a figure (`fig = plt.figure()`) before the plot - return the figure so it is displayed properly - add a line indicating the cutoff point in the graph - change the color of the existing cutoff to be red - notice that it is not visible - add a line indicating the cutoff instead Also add the scaffolding to read and analyse data before generating the graph - read the data - create a similarity object - create the bins Note that this uses the newly refactored `calc_cutoff_bins` method so we can plot the graph *before* and after the uncommon trips are removed

* plot graphs for all users * plot graphs for all users * update graph() function, adjust subplots size * remove extraneous lines,only keep the code for plotting graphs for all users

@corinne-hcr

Thanks to @corinne-hcr for finding and reporting them

…vate-data into poster_update

- Drop `fixed-width (O-D, destination)` since it is much worse than the others and we don't have the time to figure out why - Add curve fitting for the f-score vs. number of trips, showing a curve that plateaus between 125 and 375 trips

…result generation more carefully

…results Note that these results can take very long (> 2 days) to regenerate Running them from a notebook will either not print logs, or will print so many logs that the notebook buffers will be overwhelmed Moving the computation code out to a separate script allows us to more easily redirect the output to a file and track the progress of the execution

…vate-data into poster_update

…a json data dump

- Replace PLACEHOLDERS with actual opcodes - comment out knee detection from the classification performance since it didn't actually work that well - add similar curve fitting to the cluster performance although we didn't use it in the paper - initialize the predictors correctly (with strings) instead of predictors Testing done: - Ran all the notebooks, they ran without errors

Merge label-assist paper updates

* Covering up a possible error Setting the repository up for the first time might cause this error to pop up. A solution was proposed earlier in Teams chat. Just migrating it here. * Update README.md Included a check to ensure that * Update Clustering.py Update clustering.py file to link it main branch 'trip_model' rather than hlu09's tour_model_extended * Revert "Update Clustering.py" This reverts commit e90d5037d73d8504e7429b69fea8af13004c2013. * Update Readme Ensuring conf file copied to correct location * Removed whitespace

* Update clustering.py Changes in clustering.py file to shift dependency from hlu09's tour_model_extended to main branch trip_model. Still need to change type of data being passed to fit function for this to work. * moving clustering_examples.ipynb to trip_model All dependencies of this notebook from custom branch are removed. There currently seems no errors while generating maps in clustering_examples notebook. * Removing changes in builtimeseries.py With these changes, no change in e-mission-server should be required. * Changes to support TRB_Label_Assist passing way of clustering to the e-mission-server. It was 'origin-destination' by default. Now can take one of three values, 'origin','destination' or 'origin-destination'. * suggestions previous suggestions to improve readability. * Revert "suggestions" This reverts commit 3e19b32cd090135b001709cb52da57e6c6a17c1f. * Improving readability Suggestions from previous comments to improve readability. * making `cluster_performance.ipynb`, `generate_figs_for_poster` and `SVM_decision_boundaries` compatible with changes in `clustering.py` and `mapping.py` files. Also porting these 3 notebooks to trip_model `cluster_performance.ipynb`, `generate_figs_for_poster` and `SVM_decision_boundaries` now have no dependence on the custom branch. Results of plots are attached to show no difference in theie previous and current outputs. * Unified Interface for fit function Unified Interface for fit function across all models. Passing 'Entry' Type data from the notebooks till the Binning functions. Default set to 'none'. * Fixing `models.py` to support `regenerate_classification_performance_results.py` Prior to this update, `NaiveBinningClassifier` in 'models.py' had dependencies on both of tour model and trip model. Now, this classifier is completely dependent on trip model. All the other notebooks (except `classification_performance.ipynb`) were tested as well and they are working as usual. Other minor fixes to support previous changes. * [PARTIALLY TESTED] Single database read and Code Cleanuo 1. removed mentions of `tour_model` or `tour_model_first_only` . 2. removed two reads from database. 3. Removed notebook outputs ( this could be the reason a few diffs are too big to view) * Delete TRB_label_assist/first_trial_results/cv results DBSCAN+SVM (destination).csv not required. * Reverting Notebook Reverting notebooks to initial state, since running on the browser messed up the cell index numbers. This was causing unnecessary git diffs even when no changes were made. running on VS code should resolve this. WIll do the subsequent changes on VS code and commit again. * [Partially Tested]Handled Whitespaces Whitespaces corrected. * [Partially Tested] Suggested changes implemented `Classification_performance` and `regenerate_classification_performance_results.py` are not tested yet as they would take too long to run. The itertools removal in these two files is tested in other notebooks and it works. Other files, like models.py will be tested once any of the above two are run. * Revert "[Partially Tested] Suggested changes implemented" This reverts commit bb404e989b2826f159e88fa828537b24785508e3. * [Partially Tested] Suggested changes implemented [Partially Tested] Suggested changes implemented bb404e9 `Classification_performance` and `regenerate_classification_performance_results.py` are not tested yet as they would take too long to run. The itertools removal in these two files is tested in other notebooks and it works. Other files, like models.py will be tested once any of the above two are run. * Minor variable fixes Fixed names of variables to be more self-explanatory * [TESTED] All the notebooks and files are tested 1. Change in models file a.t. changes in greedy_similarity_binning in e-mission-server 2.Minor fixes * Minor Fixes Minor Fixes to improve readability. * Minor Fixes in models.py Improved readability

…rver' into receive-models-from-emission-eval-private-data

REmoving additional files that came with the model.

Removing unnecessary files

Removing files

Updating import paths and dependencies among the four files ( mapping.py,clustering.py,models.py,data_wrangling.py) that were recently moved from e-mission-eval-private-data

humbleOldSage · 2023-12-01T03:49:41Z

corresponding PR on e-mission-eval-private-data is e-mission/e-mission-eval-private-data#40

shankari · 2023-12-02T04:03:49Z

@humbleOldSage this has extraneous commits as well, including commits that are completely unrelated to this (e.g. 'check in percom analysis before we forget"). We should copy these files over with the commit history related to them, not the entire commit history of the repository.

shankari and others added 30 commits July 30, 2017 12:51

Initial commit

207946e

Check in the framework for such evaluation

f192843

Check in the first evaluation instance, which is for the TRB paper

f2dd02d

Merge pull request #1 from shankari/trb_2017

f69a28b

Evaluation for TRB 2017 paper

Check in the percom analysis before we forget

3a4a227

Support setting the HOME via environment variable here too

3fe973c

+ modify README

Merge pull request e-mission#2 from shankari/trb_2017

11059dd

Percom analysis + adapt the notebook script to read config

Fix indentation

386c182

Make the setup and teardown generic and update the README to reflect …

78a3494

…that

Merge pull request e-mission#3 from shankari/trb_2017

dbfb545

Make the setup and teardown generic and update the README to reflect …

Have the teardown delete current conf, not the e-mission conf!

b5bab6f

Also, since we are sourcing the base setup/teardown, they already operate in the current directory. No need for additional copy/remove

Merge pull request #4 from shankari/fix_setup_teardown

b6645d4

Have the teardown delete current conf, not the e-mission conf!

Create a new directory with new setup and teardown scripts

8064c39

Merge pull request #5 from shankari/fix_setup_teardown

25496b3

Create a new directory with new setup and teardown scripts

Add the setup and teardown directories for the tripaware paper

7bbca3f

Merge pull request #6 from shankari/add_tripaware_dir

a378a8c

Add the setup and teardown directories for the tripaware paper

Add guidelines on checking in notebooks

59a9442

To ensure that people don't check in sensitive material

adding notebooks

0afbfe1

made cleared outputs and uncleared outputs folders

d08608b

pulled out stats functions

dee70d6

Merge pull request e-mission#7 from jesbu1/master

af167db

adding notebooks

Fix the copyright

c6d6639

Add a list of open source licenses

9196df4

Currently, this is the same as the list for the e-mission server

Merge pull request e-mission#10 from shankari/fix_license

68f8313

Fix the copyright

Ignore conf files and improve instructions to run the evaluations (e-…

8a615ff

…mission#13)

plot graphs for all users (#15)

65743ca

* plot graphs for all users * plot graphs for all users * update graph() function, adjust subplots size * remove extraneous lines,only keep the code for plotting graphs for all users

Fix errors in setup instructions (e-mission#17)

badbe70

Thanks to @corinne-hcr for finding and reporting them

hlu109 and others added 24 commits December 16, 2022 09:21

Merge remote-tracking branch 'origin/master' into poster_update

74cc90d

update code to generate new figs for poster

9d2e10a

update usage instructions

54b6d72

count usable trips for cross-validation

b5b9d78

update SVM notebook to reflect latest data

9cf0cd3

add cluster quality comparisons

9d5b16e

replaced old notebook

9082a3b

Merge branch 'master' of https://github.com/hlu109/e-mission-eval-pri…

e486ec7

…vate-data into poster_update

Add additional logging to the calculation so that we can monitor the …

4121a52

…result generation more carefully

Merge branch 'poster_update'

1a7bc5a

Merge branch 'master' of https://github.com/hlu109/e-mission-eval-pri…

c656a2c

…vate-data into poster_update

Add a simple notebook indicating how to work with location data from …

a1c893f

…a json data dump

Merge pull request e-mission#34 from hlu109/poster_update

1d15bb5

Merge label-assist paper updates

Merge remote-tracking branch 'destination/move-models-to-e-mission-se…

7a1a3af

…rver' into receive-models-from-emission-eval-private-data

Removing unnecessary files

15106da

REmoving additional files that came with the model.

Cleanup

2ef6346

Removing unnecessary files

More Cleanup

3524096

Removing unnecessary files

Even more cleanup

5c49f29

Removing files

Update inter-file dependencies for relocated files

6863e99

Updating import paths and dependencies among the four files ( mapping.py,clustering.py,models.py,data_wrangling.py) that were recently moved from e-mission-eval-private-data

humbleOldSage mentioned this pull request Dec 1, 2023

Removing models related files e-mission/e-mission-eval-private-data#40

Merged

Minor Fixes

87c09c0

humbleOldSage mentioned this pull request Dec 3, 2023

Model receive branch #947

Merged

humbleOldSage closed this Dec 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Receive models from emission eval private data #945

Receive models from emission eval private data #945

humbleOldSage commented Dec 1, 2023

humbleOldSage commented Dec 1, 2023

shankari commented Dec 2, 2023

Receive models from emission eval private data #945

Receive models from emission eval private data #945

Conversation

humbleOldSage commented Dec 1, 2023

humbleOldSage commented Dec 1, 2023

shankari commented Dec 2, 2023