Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

So, just to make it clear, the main difference in this paper is adding a small amount of noise to each update? I'm a little frustrated that I read through the whole paper and still am not sure about this.


No the main difference is looking at the problem from a new perspective, relating it to a large body of existing work in statistics (see https://en.wikipedia.org/wiki/Markov_chain_Monte_Carlo and https://en.wikipedia.org/wiki/Stochastic_gradient_Langevin_d...). Then they are able to use this new perspective to add several improvements that lead to significant quality improvements, clearly establishing the validity and utility of the new theoretical underpinnings. It's likely this will be massively influential in the direction of future research in the space.

The actual changes this led to are:

1) As you mentioned, they added noise. But notably, they state it was "designed carefully" to conform to the requirements of SGLD and they detail how they designed the noise.

2) They simplified the original operations of "move, split, clone, prune, and add" and their related heuristics, into a single operation type. They do so guided by existing knowledge about MCMC frameworks, leading to a simpler model with stronger theoretical underpinnings (huge win!).

3) Adjustments to how gaussians are added and pruned to better fit with the new model. This seems more like housekeeping rather than something novel in and of itself.


Was the noise added to the parameters or to the input images? I assume the parameters, so that they could be fine tuned via the MCMC.


I'm not an expert at all, but my understanding is that Gaussian splatting is essentially a rendering technique. Normally, you'd take actual data, like a set of photos, and optimize some Gaussians against it to arrive at a volumetric representation. In the case of AI-generated splats, it's kinda flipped. Instead of optimizing against a known ground truth, the AI is generating Gaussians to be rendered. The insight of this paper is that we already have great statistical tools for numerically estimating ground truth based on a bunch of Gaussians, so why not just apply those?


No, Markov Chain Monte Carlo to get final stated from distribution which is hard to tackle any other way




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: