PLSA.data package¶

Submodules¶

Module for processing data

The function of this Module is served for processing data.

PLSA.data.processing.cut_groups(data, col, cutoffs)¶

Cut data into subsets according to cutoffs

Parameters:	data (pandas.DataFrame) – Data to split. col (str) – Name of column in data to compare with. cutoffs (list(int)) – List of cutoffs, like as [min-value, 30, 60, max-value].
Returns:	List of sub-data as DataFrame.
Return type:	list(pandas.DataFrame)

Examples

>>> cut_groups(data, "X", [0, 0.4, 0.6, 1.0])
[pandas.DataFrame, pandas.DataFrame, pandas.DataFrame]

PLSA.data.processing.parse_surv(x, label)¶

Parse raw-data for survival analyze(Deep Surival).

Parameters:	x (np.array) – two-dimension array indicating variables. label (dict) – Contain ‘e’, ‘t’. examples as {‘e’: np.array, ‘t’: np.array}.
Returns:	Sorted (x, e, t) tuple, index of people who is failure or at risk, and type of ties.
Return type:	tuple

Examples

>>> parse_surv(data[x_cols].values, {'e': data['e'].values, 't': data['t'].values})

PLSA.data.processing.prepare_surv(x, label)¶

Prepare data for survival analyze(Deep Surival).

Parameters:	x (numpy.array) – Two-dimension array indicating variables. label (dict) – Contain ‘e’, ‘t’. examples as {‘e’: np.array, ‘t’: np.array}.
Returns:	Sorted (x, label) tuple of survival data.
Return type:	tuple

Examples

>>> prepare_surv(data[x_cols].values, {'e': data['e'].values, 't': data['t'].values})