Preprocessing

niaarm.preprocessing.squash(dataset, threshold, similarity='euclidean')

Squash dataset.

Parameters:
  • dataset (Dataset) – Dataset to squash.

  • threshold (float) – Similarity threshold. Should be between 0 and 1.

  • similarity (str) – Similarity measure for comparing transactions (euclidean or cosine). Default: ‘euclidean’.

Returns:

Squashed dataset.

Return type:

Dataset