We’re proud to open-source LIDARLearn [R] [D] [P]

Reddit r/MachineLearning Tools

Summary

LIDARLearn is an open-source PyTorch library for 3D point cloud deep learning that unifies 56 pre-configured models with built-in cross-validation and automatic publication-ready LaTeX report generation. The framework supports supervised, self-supervised, and parameter-efficient fine-tuning methods across datasets like ModelNet40, ShapeNet, and remote sensing benchmarks.

It’s a unified PyTorch library for 3D point cloud deep learning. To our knowledge, it’s the first framework that supports such a large collection of models in one place, with built-in cross-validation support. It brings together 56 ready-to-use configurations covering supervised, self-supervised, and parameter-efficient fine-tuning methods. You can run everything from a single YAML file with one simple command. One of the best features: after training, you can automatically generate a publication-ready LaTeX PDF. It creates clean tables, highlights the best results, and runs statistical tests and diagrams for you. No need to build tables manually in Overleaf. The library includes benchmarks on datasets like ModelNet40, ShapeNet, S3DIS, and two remote sensing datasets (STPCTLS and HELIALS). STPCTLS is already preprocessed, so you can use it right away. This project is intended for researchers in 3D point cloud learning, 3D computer vision, and remote sensing. Paper 📄: [https://arxiv.org/abs/2604.10780](https://arxiv.org/abs/2604.10780) It’s released under the MIT license. Contributions and benchmarks are welcome! GitHub 💻: [https://github.com/said-ohamouddou/LIDARLearn](https://github.com/said-ohamouddou/LIDARLearn)
Original Article

Similar Articles

Lite3R: A Model-Agnostic Framework for Efficient Feed-Forward 3D Reconstruction

Hugging Face Daily Papers

Lite3R is a model-agnostic framework that improves the efficiency of transformer-based 3D reconstruction using sparse linear attention and FP8-aware quantization. It reduces latency and memory usage by up to 2.4x while maintaining geometric accuracy on backbones like VGGT and DA3-Large.

microsoft/Lens-Turbo

Hugging Face Models Trending

Microsoft releases Lens, a 3.8B-parameter foundational text-to-image model with efficient training and fast high-resolution generation, featuring dense-caption pre-training and mixed-resolution learning.