@jadesrochers/filereworker

Get information on data files, split and preserve headings

Usage no npm install needed!

<script type="module">
  import jadesrochersFilereworker from 'https://cdn.skypack.dev/@jadesrochers/filereworker';
</script>

README

File Reworker

Has a variety of helper functions for working with files to get file stats
like path,stats, name. The main functions though are for splitting files by size or by lines
into smaller files.

Whats the Use?

The file helpers are just little shortcuts, but the splitting functions
are used for making very large files into smaller files without breaking
lines in data such as text,csv.

installation

npm install @jadesrochers/filereworker

const filere = require('@jadesrochers/checksettings')

Usage

There are several ways to split files; by size, number, and cores,
with each option having size and lines sub options.

Manually split by size or lines

Split a file into pieces specified by size in Mb.

fsplit.splitFileSize('/fake/filename.txt')(10)

Split a file into pieces specified by number of lines.

fsplit.splitFileLines(['/path/to/fakefile.txt',2500])

For both these functions, if the size/lines does not result in evenly
dividing the file, the last file will be the remainder so the others are equal.

Automatic sizing of the splits

For these functions you don't need to provide a size or lines argument, just an indication of which size metric to use.

Split into even size/line piece specifying just the number

Specify the number of files, and based on either lines or size will try and make that exact number as even as possible.

fsplit.splitEvenSize('/not/a/real/file.csv')
fsplit.splitEvenLines('/not/a/real/file.csv')

Split by number of cores

Splits files into pieces based on the number of cores you have minus 1.
There is a size(S) and lines(L) version based on which will more accurately
split up the processing workload for the resulting files.

fsplit.splitByCoresS('/not/a/real/file.csv')
fsplit.splitByCoresL('/not/a/real/file.csv')

Helper functions

Some function to do basic path and sizing tasks.

Get file name:

fh.getFileName('/path/to/filename.txt')  
>> filename.txt 

Get a file path:

filere.getPath('/test/name/file.js')  
>> /test/name

Get file name without extension:

fh.nameAndBody('/fake/path/to/fakefile.txt')  
>> fakefile

Get file size in Mb:

fh.getFileSizeMb('/not/a/real/file.csv')

Get number of lines in file:

fh.getFileLines('/path/to/fakefile.txt')