Modify depth dimensions to match our input #2

Santoi · 2024-08-06T13:32:24Z

The original application is intended to be executed while capturing the NeRF input at the same time, or transmitting the input with DSS.

This PR performs the required tweaks to use previously recorded datasets in the following format:

dataset-dir
|
|--- depth
|    |
|    `--- 0.png
|--- rgb
|    |
|    `--- 0.png
|
` transforms.json

Changes:

The depth vector has the channels separated into 3 vectors, but the algorithm expects them to be all in 1 vector. Flattening the last dimension solves this by taking those 3 vectors and joining them.
When using the depth mask for the color_mask, it is expected that it has only 1 channel instead of 3, so the tiling is adapted for 3 channels. This is because the depth image is black and white, and could have been represented with 1 channel.
Add the possibility of generating a pointcloud output with the visualization script.
WARNING: Visualization script freezes if there is no GUI when ran.

hidmic

Mind to explain the flattening and tiling that is going on?

hidmic · 2024-08-07T18:35:37Z

venv_requirements.txt

@Santoi a missing dependency? If opencv is not used for visualization, consider using pulling opencv-python-headless instead. opencv-python and matplotlib don't play ball in certain cases (due to PyQt compatibility issues).

Got it, thanks!

hidmic · 2024-08-07T18:36:51Z

scripts/splatam.py

@@ -107,9 +107,9 @@ def get_pointcloud(color, depth, intrinsics, w2c, transform_pts=True,

    # Select points based on mask
    if mask is not None:
-        point_cld = point_cld[mask]
+        #point_cld = point_cld[mask]


@Santoi why comment this out?

2 reasons, not necessarily valid.

It was causing some tensor dimension disparity I hadn't been able to solve.
This is now addressed in a new commit by generating the mask from a single channel, instead of all of them.

It didn't make sense since the mask is being created from valid depth values, and every depth value we had was valid (greater than 0). Just in case this happens, let's apply it.

Still don't understand what's going on here. How is that depth maps have 3 channels?

The capturing app saves the depth as a gray scale png images, therefore each pixel is represented with red, green and blue channels, which all have the same value.

Hmm, that is unusual. Depth maps usually use single-channel, unsigned integer pixels. How is the app converting that grayscale? Now I wonder if we are not losing information to this conversion.

The multi-channel depth map is only provided by the NeRF Capture's "offline-mode", which currently has some issues:

On one side, it hasn't been tested out by the SplaTAM authors. This means it probably shouldn't work right out of the box.

On a second note, it seems it is simply broken, see comment.

In the meantime, I will give it a try with this issue's suggestions to see how it goes: spla-tam#59 (comment)

Santoi · 2024-08-13T20:41:13Z

Hi, @hidmic ! Thanks for the review!

PR description has been updated to better explain the changes. PTAL.

Santoi added 3 commits August 6, 2024 10:06

Add missing deps for visualization

eb172ee

Change dimentions to match our current input from NeRF Caputure

8d79c76

Write pointcloud file

d5003b8

Santoi requested review from olmerg and hidmic August 6, 2024 13:44

hidmic reviewed Aug 7, 2024

View reviewed changes

Create mask from the 1st depth channel and apply it to pointcloud.

8a041d9

Santoi force-pushed the slopez/offline-nerfcapture branch from 54b34d0 to 8a041d9 Compare August 13, 2024 20:40

Santoi requested a review from hidmic August 13, 2024 20:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modify depth dimensions to match our input #2

Modify depth dimensions to match our input #2

Santoi commented Aug 6, 2024 •

edited

Loading

hidmic left a comment

hidmic Aug 7, 2024

Santoi Aug 13, 2024

hidmic Aug 7, 2024

Santoi Aug 13, 2024

hidmic Aug 14, 2024

Santoi Aug 14, 2024

hidmic Aug 14, 2024 •

edited

Loading

Santoi Aug 27, 2024

Santoi commented Aug 13, 2024

Modify depth dimensions to match our input #2

Are you sure you want to change the base?

Modify depth dimensions to match our input #2

Conversation

Santoi commented Aug 6, 2024 • edited Loading

Changes:

hidmic left a comment

Choose a reason for hiding this comment

hidmic Aug 7, 2024

Choose a reason for hiding this comment

Santoi Aug 13, 2024

Choose a reason for hiding this comment

hidmic Aug 7, 2024

Choose a reason for hiding this comment

Santoi Aug 13, 2024

Choose a reason for hiding this comment

hidmic Aug 14, 2024

Choose a reason for hiding this comment

Santoi Aug 14, 2024

Choose a reason for hiding this comment

hidmic Aug 14, 2024 • edited Loading

Choose a reason for hiding this comment

Santoi Aug 27, 2024

Choose a reason for hiding this comment

Santoi commented Aug 13, 2024

Santoi commented Aug 6, 2024 •

edited

Loading

hidmic Aug 14, 2024 •

edited

Loading