filesampler: File Sampler

A collection of utilities for reading subsamples of flat text files by line in a reasonably efficient manner. We do so by sampling as the input file is scanned and randomly choosing whether or not to dump the current line to an external temporary file. This temporary file is then read back into R. For (aggressive) 'downsampling', this is a very effective strategy; for resampling, you are much better off reading the full dataset into memory.

Source: filesampler_0.4-0.tar.gz    Archive

Author Drew Schmidt [aut, cre], Daniel Lemire [ctb] (vectorized line counter)
Maintainer Drew Schmidt <wrathematics@gmail.com>
Version 0.4-0
License BSD 2-clause License + file LICENSE
Depends R (>= 3.5.0)
Imports utils
Compiled? yes
URL https://github.com/wrathematics/filesampler
BugReports https://github.com/wrathematics/filesampler/issues