cat2cat.datasets

Functions

load_verticals(→ pandas.DataFrame)

load trans dataset

load_trans(→ pandas.DataFrame)

load trans dataset

load_occup(→ pandas.DataFrame)

load occup dataset

load_occup_panel(→ pandas.DataFrame)

load occup_panel dataset

Module Contents

cat2cat.datasets.load_verticals() pandas.DataFrame

load trans dataset trans dataset containing mappings (transitions) between old (2008) and new (2010) occupational codes

Returns:

trans dataset

Return type:

pandas.DataFrame

cat2cat.datasets.load_trans() pandas.DataFrame

load trans dataset trans dataset containing mappings (transitions) between old (2008) and new (2010) occupational codes

Returns:

trans dataset

Return type:

pandas.DataFrame

cat2cat.datasets.load_occup(small: bool = False) pandas.DataFrame

load occup dataset

occup dataset is an example of unbalance panel dataset. This is a simulated data although there are applied a real world characteristics from national statistical office survey. The original survey is anonymous and take place every two years. It is presenting a characteristics from randomly selected company and then using k step procedure employees are chosen.

Parameters:

small (bool) – if to use a shrinked version of dataset

Returns:

occup dataset

Return type:

pandas.DataFrame

cat2cat.datasets.load_occup_panel() pandas.DataFrame

load occup_panel dataset

occup_panel is an occupational panel-style example dataset from the R package, useful for validating weighted/probabilistic workflows.

Returns:

occup_panel dataset

Return type:

pandas.DataFrame