Perpetua: Multi-Hypothesis Persistence Modeling for Semi-Static Environments

Pre-print

Miguel Saavedra-Ruiz
Mila - Quebec AI Institute
Université de Montréal
DIRO
Samer B. Nashed
Mila - Quebec AI Institute
Université de Montréal
DIRO
Charlie Gauthier
Mila - Quebec AI Institute
Université de Montréal
DIRO
Liam Paull
Mila - Quebec AI Institute
Université de Montréal
DIRO




Abstract. Many robotic systems require extended deployments in complex, dynamic environments. In such deployments, parts of the environment may change in between subsequent observations by the robot. Few robotic mapping or environment modeling algorithms are capable of representing dynamic features in a way that enables predicting their future state. Instead, most approaches opt to filter certain state observations, either by removing them or some form of weighted averaging. This paper introduces Perpetua, a method for modeling the dynamics of semi-static features. Perpetua is able to: incorporate prior knowledge about the dynamics of the feature if it exists, track multiple hypotheses, and adapt over time to enable predicting their future state. Specifically, we chain together mixtures of ``persistence'' and ``emergence'' filters to model the probability that features will disappear or reappear in a formal Bayesian framework. The approach is an efficient, scalable, general, and robust method for estimating the state of features in an environment, both in the present as well as at arbitrary future times. Through experiments on both simulated and real-world data, we find Perpetua yields better accuracy than similar approaches while also being online adaptable and robust to missing observations.

About

In this page we present an introduction to our persistence estimator Perpetua, as well as additional videos. For further information and a more detailed explanation, please refer to our paper!

Task

In this work, we focus on environments undergoing semi-static changes, where features may appear and disappear over time, but where these transitions are not necessarily observable. Our goal is to both model and predict the presence or absence of these features over time—a task known as persistence estimation—at any given time $t \in [0, \infty)$.

Method

We introduce Perpetua, a powerful and efficient method for estimating the persistence of semi-static features. Designed to handle dynamic environments, our method offers key advantages, including robustness to missing observations, online adaptation from data, accurate future persistence prediction, the ability to model multiple persistence hypotheses, and reduced dependence on prior knowledge.

Perpetua models persistence using a combination of two mixture models: one for disappearance, called the mixture of persistence filters, and one for reappearance, known as the mixture of emergence filters. This approach allows us to track multiple persistence hypotheses for a feature, capturing changes occurring at different time scales, such as hourly versus daily variations. These models are paired with a switching mechanism (state machine) that dynamically toggles between them, ensuring accurate tracking of feature persistence over time. Unlike previous filtering-based approaches, which often rely on fixed parameters or manual tuning, we introduce an expectation-maximization-based framework for learning model parameters directly from noisy observations.


The figure shows the mixture of emergence and persistence filters in the left panel, where each model consists of two mixture components. By combining the different components from both models, we generate up to four possible persistence estimates for the feature's state, as depicted in the middle panel. Finally, based on the available measurements, Perpetua selects the combination that achieves the highest posterior weight, enabling it to accurately model the persistence of a semi-static feature, as shown in the right panel. This panel also visualizes the dynamic switching between the persistence and emergence models.

We emphasize the significance of the mixture formulation in Perpetua, as it enables the model to capture multiple persistence hypotheses for a feature.



In the above video, the top panel shows Perpetua using a two-component mixture, while the bottom panel shows a single-component mixture. From $t_0$ to $t_1$, persistence estimates adapt to the dynamics based on the available noisy observations. Between $t_1$ and $t_2$, Perpetua relies on the component with the largest posterior weight to predict feature persistence in the absence of new observations. At $t_2$, when observations resume, Perpetua with two components adjusts its persistence estimate by switching to the mixture component that best explains the new feature dynamics, using it to make persistence predictions beyond $t_3$. In contrast, Perpetua with one component fails to make this adjustment.

Evaluation

We evaluate the predictive capabilities of our method and its robustness to data sparsity in both a simulated and a real-world environment, using two distinct datasets. Throughout the evaluation, we compare our method against four well-established baselines for persistence estimation in semi-static scenes.

Simulation. A robot navigates a room composed of eight landmarks where four are static and four are semi-static. All semi-static landmarks have different appearance and disappearance times, ranging from 1min to 20min, sampled from a mixture of log-normal distributions. Landmarks can have up to three distinct appearance and disappearance times. During data collection, the robot navigates counter-clockwise while during testing the robot moves clockwise.


Train

Test


Parking Lot Dataset. Dataset from Almeida et al. which consists of 12,427 images of parking lots captured under varying environmental conditions. These images were taken in the parking lots of the Federal University of Parana (UFPR), with observations recorded every five minutes for more than 30 days. Our evaluation focus on the UFPR04 and UFPR05 subsets, which represent different views of the same parking lot captured from the fourth and fifth floors of the UFPR building. To match real-world conditions, we augment the dataset using an off-the-shelf object detector, YOLOv11, to obtain observations.


UFPR04
UFPR05

Results

The following image shows prediction (a-c) and the data sparsity ablation (d-f) in the simulated room (a,d) and parking lot datasets, UFPR04: (b,e), UFPR05: (c,f). In this assessment we use as metric the balanced accuracy (B-Accuracy), which accounts for class imbalance by equally weighting true positives and true negatives. The top row shows the balanced accuracy of all methods when predicting feature persistence for a prediction time ($\Delta t$) in the future. The bottom row presents an ablation study on data sparsity, evaluating filter-based methods when only a random subset of the noisy test data is available.


The prediction results, shown in the top row of the figure, demonstrate the strong predictive capabilities of our method across different forecasting horizons, outperforming the baselines. Similarly, the bottom row illustrates that by using only 5 to 20% of the available observations, Perpetua consistently outperforms all other methods.

Lastly we show the persistence estimates of our method compared with the baselines. As seen, the persistence estimates from our method align with the ground truth state of a semi-static feature in the UFPR04 dataset.


Example results obtained by our method and baselines when estimating the persistence of a feature in the UFPR04 data set. Here, we threshold the persistence estimates of all methods at 0.5.

Citation

@article{saavedra2025perpetua,
	title        = {Perpetua: Multi-Hypothesis Persistence Modeling for Semi-Static Environments},
	author       = {Saavedra-Ruiz, Miguel and Nashed, Samer and Gauthier, Charlie and Paull, Liam},
	year         = 2025,
	journal      = {arXiv preprint},
}    

Department of Computer Science and Operations Research | Université de Montréal | Mila