ParkingWorld
ParkingWorld system overview

3DGS Simulation + Human Experience + End-to-End RL

ParkingWorld

End-to-End Autonomous Parking Reinforcement Learning from Human Experience in 3DGS Simulation

Zhengcheng Yu, Changze Li, Haoran Liu, Tong Qin

0%

Simulation PSR

0%

Real-Vehicle PSR

0

Standard Slots

What It Solves

Learning to park from corrected mistakes

ParkingWorld trains an end-to-end autonomous parking policy inside photorealistic 3D Gaussian Splatting parking scenes. Instead of relying only on ordinary RL rollouts or large expert datasets, the system couples what the autonomous policy did wrong with how humans corrected the failure. This correction-in-the-loop replay design makes training more sample-efficient, safer, and more transferable to real vehicles.

Photorealistic 3DGS scenes BEV perception-policy network Rollback correction replay Real Changan CS55 deployment

Framework

Closed-loop training inside reconstructed parking worlds

Interactive 3DGS autonomous parking simulator
Interactive ROS-based 3DGS simulator with rendering, dynamics, collision checking, rollback recording, and human-in-the-loop training.

CIL-SERL

Correction-in-the-loop sample-efficient RL

Human interventions are not simply mixed into a single demonstration buffer. ParkingWorld stores normal RL rollouts, human takeover segments, failed autonomous trajectories, and corrected rollback segments in coupled replay buffers. The policy therefore learns both the failure context and the corresponding recovery behavior.

Detect failure-prone autonomous states
Correct with human intervention and rollback
Replay paired failure and correction samples
ParkingWorld end-to-end RL architecture

Why 3DGS

Less domain gap, richer interaction

Comparison of CARLA LGSVL, real world, and 3DGS simulator training
ParkingWorld uses reconstructed 3DGS scenes to bridge simulator training and real-world deployment.

Simulation Experiment

Strong closed-loop performance on standard slots

Training and testing are conducted on five reconstructed 3DGS parking scenes containing 239 standard parking slots. Each method is evaluated over 200 trials.

Type Method PSR (%) ↑ PCR (%) ↓ PTR (%) ↓ PBR (%) ↓ NGS ↓
Rule-basedRS Curve31.00.069.00.01.6
Rule-basedHybrid A*55.50.045.50.014.9
End-to-endParkingE2E34.046.50.019.52.6
End-to-endREAP-PPO46.039.015.03.026.7
End-to-endREAP-SAC68.522.09.50.017.5
End-to-endParkingWorld88.04.08.00.013.2
ParkingWorld trajectory visualization across different scenes

Real-World Experiment

Deployed on a Changan CS55

Four surround-view fisheye cameras stream images at 5 Hz to a dedicated computer, where ParkingWorld predicts target speed and steering commands. The commands are rolled out into a short-horizon kinematic trajectory and tracked by an onboard NUC using sampled NMPC at 50 Hz.

80.0%real PSR
5.0%near-collision rate
0.0%boundary crossing
Real vehicle deployment framework for ParkingWorld

Real-Vehicle Comparison

Robust transfer beyond simulation

Type Method PSR (%) ↑ PCR (%) ↓ PTR (%) ↓ PBR (%) ↓ NGS ↓
Rule-basedRS Curve15.00.085.00.06.5
Rule-basedHybrid A*35.00.065.00.029.7
End-to-endParkingE2E30.045.00.025.08.5
End-to-endREAP-PPO40.02.035.05.033.8
End-to-endREAP-SAC60.015.025.00.026.6
End-to-endParkingWorld80.05.015.00.022.1

Citation

BibTeX

@article{parkingworld2026,
  title   = {ParkingWorld: End-to-End Autonomous Parking Reinforcement Learning from Human Experience in 3DGS Simulation},
  author  = {Yu, Zhengcheng and Li, Changze and Liu, Haoran and Qin, Tong},
  journal = {IEEE Robotics and Automation Letters},
  year    = {2026}
}