The hardware and bandwidth for this mirror is donated by dogado GmbH, the Webhosting and Full Service-Cloud Provider. Check out our Wordpress Tutorial.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]dogado.de.

Getting Started with Price Index Calculation using REPS

Introduction

The calculate_price_index() function is the central entry point in REPS for computing price indices using various hedonic-based methods. It supports six commonly used approaches:

This vignette demonstrates how to apply each method using a consistent interface, making it easy to compare results across approaches.

The HMTS method implemented in REPS is a multilateral, time-series-based index that balances stability, limited revision, and early detection of turning points in the context of property price indices (Ishaak, Ouwehand, and Remøy 2024).

For broader context and international guidelines on the compilation of property price indices, including traditional methods such as hedonic double imputation Laspeyres, Paasche, Fisher and Repricing, we refer to Eurostat’s Handbook on Residential Property Price Indices (RPPIs) (Eurostat 2013). For (Rolling) Time Dummy we refer to Hill et al. (Hill et al. 2018, 2022).

Required Data

Before running any calculations, ensure that your dataset is available and contains the necessary variables:

# Example dataset (you should already have this loaded)
head(data_constraxion)
#>   period   price floor_area dist_trainstation neighbourhood_code
#> 1 2008Q1 1142226  127.41917       2.887992985                  E
#> 2 2008Q1  667664   88.70604       2.903955192                  D
#> 3 2008Q1  636207  107.26257       8.250659447                  B
#> 4 2008Q1  777841  112.65725       0.005760792                  E
#> 5 2008Q1  795527  108.08537       1.842145127                  E
#> 6 2008Q1  539206   97.87751       6.375981360                  D
#>   dummy_large_city
#> 1                0
#> 2                1
#> 3                1
#> 4                0
#> 5                0
#> 6                1

The required variables include:

Typically, for some numerical variables you may want to apply a log transformation. For example, floor_area is often log-transformed to improve linearity, stabilize variance, and reduce the impact of extreme values. Log-transforming variables can help meet regression assumptions by making relationships between variables more linear and residuals more homoscedastic (constant variance).

Example of log-transforming floor_area:

dataset <- data_constraxion
dataset$floor_area <- log(dataset$floor_area)

Using calculate_price_index()

The calculate_price_index() function provides a unified interface for estimating hedonic price indices. You only need to specify the method via the method argument — the function handles the rest.

Supported methods:

Example: Single Index Method - Time Dummy

Tbl_TD <- calculate_price_index(
  dataset = dataset,
  method = "timedummy",
  period_variable = "period",
  dependent_variable = "price",
  numerical_variables = c("floor_area", "dist_trainstation"),
  categorical_variables = c("dummy_large_city", "neighbourhood_code"),
  reference_period = 2015,
  number_of_observations = FALSE
)

head(Tbl_TD)
#>   period    Index
#> 1 2008Q1 99.31408
#> 2 2008Q2 97.74089
#> 3 2008Q3 98.72017
#> 4 2008Q4 98.85271
#> 5 2009Q1 98.26102
#> 6 2009Q2 98.11607

Example: Multiple Index Methods - Fisher, Paasche and Laspeyres

multi_result <- calculate_price_index(
  dataset = dataset,
  method = c("fisher", "paasche", "laspeyres"),
  period_variable = "period",
  dependent_variable = "price",
  numerical_variables = c("floor_area", "dist_trainstation"),
  categorical_variables = c("dummy_large_city", "neighbourhood_code"),
  reference_period = 2015,
  number_of_observations = FALSE
)

head(multi_result$fisher)
#>   period    Index
#> 1 2008Q1 99.61436
#> 2 2008Q2 98.33742
#> 3 2008Q3 98.98908
#> 4 2008Q4 98.02118
#> 5 2009Q1 98.49971
#> 6 2009Q2 98.26416
head(multi_result$paasche)
#>   period    Index
#> 1 2008Q1 99.84034
#> 2 2008Q2 98.44649
#> 3 2008Q3 98.88248
#> 4 2008Q4 98.23097
#> 5 2009Q1 98.55413
#> 6 2009Q2 98.17509
head(multi_result$laspeyres)
#>   period    Index
#> 1 2008Q1 99.38877
#> 2 2008Q2 98.22836
#> 3 2008Q3 99.09567
#> 4 2008Q4 97.81174
#> 5 2009Q1 98.44521
#> 6 2009Q2 98.35320

Visualizing the Index

For quick and clear visualizations, the plot_price_index() utility function can be used to generate time-series plots of the calculated indices.

While we encourage users to create custom visualizations suited to their analytical needs, this built-in plotting function provides a convenient starting point for simple and consistent line plots.

plot_price_index(multi_result)

Summary

The calculate_price_index() function streamlines access to multiple hedonic index methods via a consistent interface. This allows analysts to easily compare outputs and select the most appropriate method for their context.

References

Eurostat. 2013. Handbook on Residential Property Price Indices (RPPIs). Publications Office of the European Union. https://doi.org/10.2785/34007.
Hill, Robert J., Michael Scholz, Chihiro Shimizu, and Michael Steurer. 2018. “An Evaluation of the Methods Used by European Countries to Compute Their Official House Price Indices.” Economie Et Statistique 2018 (500–502): 221–38. https://doi.org/10.24187/ECOSTAT.2018.500T.1953.
———. 2022. “Rolling-Time-Dummy House Price Indexes: Window Length, Linking and Options for Dealing with Low Transaction Volume.” Journal of Official Statistics 38 (1): 127–51. https://doi.org/10.2478/JOS-2022-0007.
Ishaak, F. F., Pim Ouwehand, and H. T. Remøy. 2024. “Constructing Limited-Revisable and Stable CPPIs for Small Domains.” Journal of Official Statistics 40 (3): 380–408. https://doi.org/10.1177/0282423X241246617.

These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.
Health stats visible at Monitor.