Back to top
Eurostat logo
National reference metadata Information message

Luxembourg

Reference metadata describe statistical concepts and methodologies used for the collection and generation of data. They provide information on data quality and, since they are strongly content-oriented, assist users in interpreting the data. Reference metadata, unlike structural metadata, can be decoupled from the data.

For more information, please consult our metadata website section.

Close
Graphic logo

Structural business statistics (sbs)

DownloadPrint

National Reference Metadata in SBS Euro-SDMX Metadata Structure (ESMS) - from reference year 2021 onwards (ESSBS21)

Compiling agency: STATEC

Need help? Contact the Eurostat user support


 
Short metadata
Full metadata

Structural business statistics (SBS) describes the structure, conduct and performance of economic activities, down to the most detailed activity level (several hundred economic sectors). SBS covers all activities of the business economy with the exception of agricultural activities, public administration and (largely) non-market services such as education and health. Main characteristics (variables) of the SBS data category:

  • "Business demographic" variables (e.g. Number of active enterprises)
  • "Output related" variables (e.g. Net turnover, Value added)
  • "Input related" variables: labour input (e.g. Number of employees and self-employed persons, Hours worked by employees); goods and services input (e.g. Purchases of goods and services); capital input (e.g. Gross investments)

Business services statistics (BS) collection contains harmonised statistics on business services. From 2008 onwards BS become part of the regular mandatory annual data collection of SBS. The BS’s data requirement includes variable “Turnover” broken down by products and by type of residence of client. 

The annual regional statistics collection includes three characteristics due by NUTS-2 country region and detailed on NACE Rev. 2 division level (2-digits).

19 August 2025

SBS constitutes an important and integrated part of the new European Business Statistics Regulation N° 2152/2019

Data requirements, simplifications and technical definitions are defined in Commission Implementing Regulation (EU) 2020/1197

Enterprise (national level)

Local unit (regional level)

Only active market units are included.

Active units encompass the following units:

  • units with a non-zero value of turnover, non-zero value of operating costs or non-zero employees;
  • units which are active according to the SBS survey and not properly recorded in the administrative sources (e.g. VAT exempt units without employment and not having filed their detailed financial statements yet);

The market status of a unit is defined in terms of the national accounts institutional sectors. Sectors 12.7, 13 and 15 are excluded from the target population. The following units are specifically excluded:

  • temporary partnerships;
  • private persons producing energy using photovoltaic installation;
  • artificial subsidiaries and other special purpose entities dealing almost exclusively with group affiliates.

There are no exclusions in terms of NACE. However, branch H.50 (water transport) is only available as a branch estimate and not as micro-data. Consequently, the size class dimension is not reliable for this activity branch.

Luxembourg economic territory.

2023

The reference year corresponds with the calendar year.

For each company with a different financial year than the calendar year, we take a decision regarding the SBS reference period. Normally, this decision is taken in such a way that at least 6 months of the financial year for a given company are included in the corresponding SBS reference period. Once that decision is taken, it is maintained every subsequent year for comparability reasons.

Sampling error is mainly relevant for small size classes but also more generally for certain characteristics (e.g. unpaid persons employed). This error has most often a local impact. Since 2015, the sampling error has been reduced due to the integration of administrative source on accounting data.

Coverage errors are relevant for activities which carry little to no employment and which are not or only partially covered by VAT data. Examples of such activities are real estate activities, parts of the financial sector and most of the health professions. The failure to identify special purpose entities early on is another potential source of coverage error. Overall, the impact of coverage errors is local but can be significant.

Unit non-response error is generally low due to imputation strategies. For certain activities (e.g. construction, real estate, health, financial), imputation is nonetheless more challenging and can thus generate locally significant misstatements.

Measurement errors pertain mainly but not only to errors in the NACE classification and can be of significant impact locally as well as globally. Errors due to the misinterpretation of or incomplete information on globalisation phenomena are rare but can have significant global impacts due to the bigger figures of multinational enterprises.

Data processing errors are rare but may have a significant impact when they go undetected. The aforementioned data processing and analysis procedures are normally sufficient but in the past a few rare cases of such errors slipped into published data.

Preliminary results for SBS are mostly burdened by undercoverage errors (insufficient VAT and accounting data available for units with no employment), data processing errors (due to imputation) and measurement errors (mainly due to NACE misclassifications). Quality of the variables Number of persons employed and Number of enterprises tend to be understated as a consequence in preliminary SBS.

  • Number of enterprises and number of local units are expressed in units.
  • Monetary data are expressed in millions of €.
  • Employment variables are expressed in units.
  • Per head values are expressed in thousands of € per head. 

Ratios are expressed in percentages.

Unit non-response imputation

Please refer to section 13.

Item non-response imputation

Given that SBS data are above all quantitative and that they also serve the needs of national accounts, it is very difficult to identify item non-response. This is in particular true for units who are included only once in the sample in a given time series. For the bigger companies, it is possible to follow the reporting structure over a larger time period and thus to identify item non-response in an easier way. Item non-response is therefore dealt with on a case-by-case basis during editing procedures.

Missing or erroneous information in the administrative source

Administrative data can be systematically missing or punctually missing.

The risk of systematically missing administrative data is significantly higher in a few very specific financial activities (subject to prudential reporting but insufficiently detailed) and for health professions (VAT not applicable, no obligation to file any financial statements, no access to revenue tax data). In such cases, it is often not possible to fill the gaps with survey data because the survey requires the units to be active in terms of administrative source indicators (e.g. VAT, employment, accounting records, etc.).

In case of punctually missing or erroneous administrative data, adjustments are performed in a dedicated working area. For the administrative accounting data source (available as of 2015), any missing data is imputed using either cold-deck ratio imputation or hot-deck ratio imputation (grossing up).

Grossing up

When grossing up survey data, we always take into account ancillary data available in the administrative sources, i.e. the ancillary variables “turnover” and “number of employees”, and not only the sample design. The aforementioned ancillary variables most often have a linear correlation with the variables of interest in SBS. Consequently, the ratio estimator is used for grossing up.

With the variable of interest being available only for the sample (y) and with the ancillary variable being available both for the sample (x) and the SBS target population (X), the ratio estimator of the variable of interest , expressed as an average, can be written:

 2024-09-16_105647

The ratio R has been formulated in the following way:

 2024-09-16_105727

If for a given stratum no unit is available in the sample, i.e. neither for the historical periods nor for the reference period, the units in this stratum are grouped together with another stratum for which sample data are available and are then grossed up.

Variables "Number of enterprises", "Personnel costs", "Wages", "Social security charges", "Number of employees", "Number of full-time equivalent employees" and "Number of hours worked by employees" are directly derived from administrative source data and therefore do not need to be grossed up (as from reference year 2009). If prior year survey data are available, a cold-deck ratio imputation is performed – this significantly reduces the bias in case there are permanent differences between the admin source and the survey data for a given unit. For the other units, administrative data are simply pasted into the data set.

Please also refer to section 13.

Statistical survey combined with administrative sources

The annual structural business survey used to be at the heart of the data collection. While this is still true today for medium-size and large firms, for smaller entities, the administrative accounting sources (mainly Central Balance sheet Office) have progressively taken over since reference year 2015.

Even with the partial shift from survey to administrative sources, the data collection still requires a sample which is based on a stratified random sampling design. With the extension of the activity coverage as of reference year 2021, the sampling strategy was revised. Using strata based on the number of employees and turnover applied on the population frame, a random stratified sample is drawn and then divided into two subsamples:

* an exhaustive survey subsample: any legal units employing either more than 50 employees or having declared a turnover excluding VAT of at least 12 million EUR per annum are selected every year. For certain activities in the financial sector, the thresholds are increased for survey cost reasons. Any legal units linked to the aforementioned units (e.g. group or enterprise links) are also selected to form part of the survey every year. To cover the needs of national accounts and SBS, the survey sample size is roughly equal 3000 reporting units.

* a random administrative accounting source subsample: any legal units having both less than 50 employees and a turnover of less than 12 million EUR are drawn randomly by stratum and exhaustively when the stratum only consists of singletons. Instead of being surveyed, these units are later on imputed using detailed administrative accounting data available for the reference year. The sample design allows for the possibility to use these units for handling non-response in the administrative accounting source, that is units which have not yet reported their accounting data at the moment of statistical production. The sample size is roughly equal to 7000 reporting units.

Administrative accounting data may be available for any firm in the population but outside of the aforementioned sample. When such data are available, these units, which are not part of the sample and which can therefore only represent themselves in the total population, are removed from the subpopulation to gross up, thus contributing to the reduction of the sampling variance due to grossing up.

Credit establishments and insurance companies are measured entirely through administrative sources (prudential reporting) and only abide by the guidelines here above for investment variables, which are entirely measured by survey for this specific sector.

The stratification used in the sample design does not take into account the size classes defined in the transmission format of the EBS SBS regulation. However, mass imputation procedures exist on a micro-data level, so that data can be broken down according to any spanning variable defined a posteriori, albeit with an increased risk of bias:

* since 2015, detailed accounting data from the Central Balance Sheet Office have been used to impute units which are not part of the sample or which no recent survey data are available (with a sample weight of 1) - this source covers balance sheet and profit & loss. In recent years, the data source accounts for more than half of the SBS population;

* since 2005, social security data have been used to impute the number of employees, personnel costs and number of hours worked by employees;

* detailed VAT data are used to impute investment variables, unless the latter can be derived from the administrative accounting data (since reference year 2021 revised);

* VAT and social security key indicators are used to impute non-respondent units using a cold-deck ratio imputation method.

Data sources used for the population frame or target population

The statistical business register is the primary source for building the target population. The said register is enriched by key data from social security and VAT, and since 2019 from the administrative accounting data. These data are used to determine the activity status of any enterprise. Only active units are retained. There is no threshold on employment and turnover, except that either has to be positive for the reference year in question.

In addition, for units having no employment and not covered by either VAT or accounting data source, the survey helps to identify a few active units. They are added to the population frame.

NACE classification

The difference between principal and secondary activities is directly available in the business register. The approach to identify principal activities is the top-down approach. Stability rules are in place to avoid yearly shifting between principal and secondary activities.

The update frequency of the business register is daily. However, for SBS purposes the extract of the business register is frozen at a given date. From that moment on, changes in NACE are only performed on an ad hoc basis and through a dedicated working area.

Relationship between the reporting unit and the enterprise

In the SBS survey, the reporting unit is the legal unit. However, for a very few legal units, the data are collected separately for the economic subdivisions of the legal unit (kind-of-activity units). Moreover, local unit data are collected at legal unit level for units being part of the SBS survey. In the administrative sources, the reporting unit is the legal unit.

Finally, the statistical unit "enterprise" can be a legal unit or a combination of legal units. Enterprises which are formed by a group of legal units are not the general case in Luxembourg but their occurrence is still significant, in particular due to many legal units specialized in ancillary activities for other legal units belonging to the same enterprise group.

Annual.

Time between end of reference period and national data dissemination: t+22 to t+24 months

Not applicable.

Length of comparable time series

1995 - 2002 (KAU concept)

2003 - 2009 (enterprise concept NACE Rev.1.1)

2005 - 2023 (enterprise concept NACE Rev.2)

Important events in the time series

revision of SBS data series was completed in summer 2014 for the reference years 2005 to 2010 included. The revision was performed using NACE Rev.2 and included the revision of the profiling of some major players. The revision was significant due to the adaptation of the enterprise definition for some multinational enterprises and in general due to changes in NACE codes for enterprises.

As of 2012, the identification of the legal unit providing the activity support to the enterprise to which it belongs is allocated according to the contribution in terms of value-added of each legal unit composing the enterprise, in case the support unit allocated in the Business Register is too recent. The impact of this change is local and not global. For the same reference year, punctual, local and significant breaks in series are flagged accordingly in the tables. The reasons for the breaks are typically significant errors in the NACE codes for enterprises as well as significant measurement errors detected in hindsight.

As of 2014, the technical definition of the target population in terms of enterprises and KAUs has changed: KAUs without any activity support unit in the SBS target population as defined according to the enterprise concept are no longer part of the said population. The impact of this change in method had no significant impact but was important to ensure consistency with the financial sector.

As of 2015, Central Balance Sheet Office data are used to impute units which are not part of the sample or which no recent survey data are available. The impact of the data integration is an overall improvement of the sampling error, potentially traded for an increased measurement error at a local level. This integration lead to several local breaks in time series.

As of 2019 reference year, a rolling revision policy has been introduced. After the official production of reference year T (e.g. 2019) published during the year T+2 (e.g. 2021), a revision for the reference year T data is performed during T+3 (e.g. 2022). The impact of this revision not only removes local breaks in series due to temporary errors in NACE but also introduces a less desirable side-effect: due to the consideration of late VAT data during the revision, there is a potentially significant upward impact on the number of smaller active enterprises and the number of persons employed (excluding employees).

As of 2021 reference year, the SBS have been extended to NACE sections P, Q, R and K as well as NACE division S96, in conformity with the EBS regulation. These more recently added activity branches present certain challenges, most of which will be ironed out in the years to come. Furthermore, several alignments with business demography were performed.

As of 2021 revised, data from the Central Balance Sheet Office are used to impute the investment variables when they are not already covered by the SBS survey. This has lead to an overall break in series, even though the impact can be mainly observed in a few activity branches pertaining to construction, real estate and rental services.