datascience.tables.Table.sample¶

Table.sample(k=None, with_replacement=False, weights=None)[source]¶

Returns a new table where k rows are randomly sampled from the original table.

Kwargs:

k (int or None): If None (default), all the rows in the table are: sampled. If an integer, k rows from the original table are sampled.
with_replacement (bool): If False (default), samples the rows: without replacement. If True, samples the rows with replacement.
weights (list/array or None): If None (default), samples the rows: using a uniform random distribution. If a list/array is passed in, it must be the same length as the number of rows in the table and the values must sum to 1. The rows will then be sampled according the the probability distribution in weights.

Returns:

A new instance of Table.

>>> jobs = Table().with_columns([
...     'job',  ['a', 'b', 'c', 'd'],
...     'wage', [10, 20, 15, 8]])
>>> jobs
job  | wage
a    | 10
b    | 20
c    | 15
d    | 8
>>> jobs.sample() 
job  | wage
b    | 20
c    | 15
a    | 10
d    | 8
>>> jobs.sample(k = 2) 
job  | wage
b    | 20
c    | 15
>>> jobs.sample(k = 2, with_replacement = True,
...     weights = [0.5, 0.5, 0, 0]) 
job  | wage
a    | 10
a    | 10

datascience.tables.Table.sample¶

Previous topic

Next topic

This Page