Skip to content

Latest commit

 

History

History
30 lines (23 loc) · 800 Bytes

README.md

File metadata and controls

30 lines (23 loc) · 800 Bytes

KTH Logo Dataset

A toy dataset based on KTH's logotype.

  • Uniformly sampled from the logo
  • Standardized: $x' = \frac{x - \textrm{mean}(x)}{\textrm{std}(x)}$
  • Interface similar to scikit-learn's toy datasets.

Sampling from the dataset shown in an animated scatter plot

Install

pip install git+https://github.com/klaswijk/kth-dataset

Usage

from kth_dataset import make_kth_dataset

x, y = make_kth_dataset(
    n_samples=1000,
    noise=0.0,  # Standard deviation of Gaussian noise
    path=".",
    download=True,  # Downloads kth_logo.png to path
    random_state=0
)

# x is a (n_samples, 2) array containing the samples 
# y is all zeroes