Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generalize to Nuscenes is not good #26

Open
SiYLin opened this issue Sep 14, 2023 · 10 comments
Open

Generalize to Nuscenes is not good #26

SiYLin opened this issue Sep 14, 2023 · 10 comments

Comments

@SiYLin
Copy link

SiYLin commented Sep 14, 2023

Hi, Thx for the great work. I recently have tried to apply your work on various autonomous driving dataset and find out the performance is way lower compare to the information you gave in paper such as KITTI, under same max init error. E.g. The position XY 5m recall is only around 40% under 32 init error.

@sarlinpe
Copy link
Collaborator

  1. Do you properly gravity-rectify and undistort the images? Do the gravity estimates look sufficiently accurate?
  2. Are you sure that the ground truth poses of these datasets are sufficiently accurate for such evaluation?

@SiYLin
Copy link
Author

SiYLin commented Sep 15, 2023

Thank you for your response. I appreciate your diligence in this matter. Upon my initial interpretation, I may have made a mistake.

I conducted tests on the KITTI dataset and the results aligned well with the data provided in your paper. However, when I applied the same tests to the Nuscenes dataset, I observed a significant drop in the XY position (5m), which fell to 48% recall under a maximum initial error of 32m.

I have verified the roll pitch that we used to rectify the camera (similar to the process what you used in KITTI dataset, pixel to camera, camera to vehicle, and vehicle to world). For my training, I only use the front camera data.

@sarlinpe
Copy link
Collaborator

sarlinpe commented Sep 15, 2023

  1. So you actually do retrain the model on the Nuscenes dataset? Do you initialize it with the model trained on Mapillary? How large is the dataset? Are you sure that it is not simply overfitting?
  2. I am not familiar with the Nuscenes dataset, but it may simply be more difficult than KITTI if it has sparser distinctive semantic elements (more data from highways, less from city & residential areas) or if OSM is of lower quality in these areas (incorrect or fewer labels).

@SiYLin
Copy link
Author

SiYLin commented Sep 15, 2023

  1. Yes, I retrained on Nuscenes.
  2. No,I trained model from scratch. The nuscenes dataset containes around 30K training samples and 6k validation samples.

@sarlinpe
Copy link
Collaborator

This is likely too small. If the training and validation data are in disjoint areas, you should see a clear overfitting.

@SiYLin
Copy link
Author

SiYLin commented Sep 16, 2023

Thx for your replying! It does overfit the training dataset. The recall half meters is 100% in training dataset....May I know how much data in general to train this model?

@sarlinpe
Copy link
Collaborator

The more the better. Try initializing your model with the pre-trained Mapillary model that we provide.

@jike5
Copy link

jike5 commented Feb 22, 2024

Hi, Thx for the great work. I recently have tried to apply your work on various autonomous driving dataset and find out the performance is way lower compare to the information you gave in paper such as KITTI, under same max init error. E.g. The position XY 5m recall is only around 40% under 32 init error.

Hello, SiYlin, I've also been trying to migrate Orienternet to NuScenes recently. I wanted to ask you how you're getting GPS information. As far as I know, NuScenes only provides ground truth pose and doesn't include raw GPS information with noise. Thank you very much!

@SiYLin
Copy link
Author

SiYLin commented Feb 22, 2024

Hi, Thx for the great work. I recently have tried to apply your work on various autonomous driving dataset and find out the performance is way lower compare to the information you gave in paper such as KITTI, under same max init error. E.g. The position XY 5m recall is only around 40% under 32 init error.

Hello, SiYlin, I've also been trying to migrate Orienternet to NuScenes recently. I wanted to ask you how you're getting GPS information. As far as I know, NuScenes only provides ground truth pose and doesn't include raw GPS information with noise. Thank you very much!

Hi :
You can try to add some random noise on the ground truth pose to fake the scenario in which you have noise GPS information.

@jike5
Copy link

jike5 commented Feb 22, 2024

Hi, Thx for the great work. I recently have tried to apply your work on various autonomous driving dataset and find out the performance is way lower compare to the information you gave in paper such as KITTI, under same max init error. E.g. The position XY 5m recall is only around 40% under 32 init error.

Hello, SiYlin, I've also been trying to migrate Orienternet to NuScenes recently. I wanted to ask you how you're getting GPS information. As far as I know, NuScenes only provides ground truth pose and doesn't include raw GPS information with noise. Thank you very much!

Hi : You can try to add some random noise on the ground truth pose to fake the scenario in which you have noise GPS information.

Got it, that's exactly what I'm doing now🤣, thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants