Lego
Original and NeuGen images of the class "lego".
Neural Radiance Fields (NeRF) have significantly advanced the field of novel view synthesis, yet their generalization across diverse scenes and conditions remains challenging. Addressing this, we propose the integration of a novel brain-inspired normalization technique Neural Generalization (NeuGen) into leading NeRF architectures which include MVSNeRF and GeoNeRF. NeuGen extracts domain-invariant features, thereby enhancing the models' generalization capabilities. Our methodology involves fine-tuning pre-trained NeRF models with the incorporation of NeuGen, which facilitates the construction of a more informative feature set that leads to an accurate and robust rendering of images. Through this integration, NeuGen shows benchmarking performance on diverse datasets across state-of-the-art NeRF architectures, enabling them to generalize better across varied scenes. Our comprehensive evaluations, both quantitative and qualitative, confirm that our approach not only surpasses existing models in generalizability but also markedly improves rendering quality. Our work exemplifies the potential of merging neuroscientific principles with deep learning frameworks, setting a new precedent for enhanced generalizability and efficiency in novel view synthesis.
Our pipeline begins with the original images (denoted as I), which are segmented into patches. These patches are then sequentially processed through NeuGen layers, as depicted in the associated equations. The collective output from all sections of the NeuGen layer is what we refer to as a NeuGen image (I^G), an idea inspired by the mammalian visual cortex's method of interpreting scenes. Subsequently, we combine these NeuGen images with the original ones to create what we term NeuGen-enhanced images (I^E). This enhancement plays a crucial role in improving feature extraction, which is vital for the input to Neural Radiance Fields (NeRF) models. The diagram's right side illustrates the volumetric rendering step, a critical component common to all NeRF methodologies, including well-known ones like MVSNeRF and GeoNeRF. It's important to recognize that while this process is fundamental, the specific details and feature volume implementations are uniquely adapted in each architecture to suit their distinct requirements.
Original and NeuGen images of the class "lego".
Original and NeuGen images of the class "drums".
[1] Chen, A., Xu, Z., Zhao, F., Zhang, X., Xiang, F., Yu, J., & Su, H. (2021).
MVSNeRF: Fast Generalizable Radiance Field Reconstruction from Multi-View Stereo.
arXiv preprint arXiv:2103.15595.
[2] Johari, M. M., Lepoittevin, Y., & Fleuret, F. (2022). GeoNeRF: Generalizing NeRF with Geometry Priors.
arXiv preprint arXiv:2111.13539.