Lab Data Analysis Toolkit

LabDAT is a high-level library for processing experiment data. It simplifies and automates much of the pre-processing stages while enforcing standardized best-practices that allow for easy replication and extension of your pipeline. Built in functions for common raw data structures provide plug-and-play adaptation for most projects, and the modular framework allows for easy expansion to fit any need.

NOTE: In early development, major revisions will be made to both documentation and the codebase.

Getting Started

Configuration

LabDAT can function as a referenced library, or run as an automated pipeline using a config file (YAML, JSON, PICKLE).

Directories

LabDAT requires an input and an output directory. If an output is not defined, it is assumed that to be the same as the input directory.

Additionally, you can provide locations for your custom configuration and script files. These will automatically be added to your LabDAT library for later use.

dir:
  INPUT: /content/example/data
  OUTPUT: /content/example/output
  CONFIG: /content/example/cfg
  SCRIPTS: /content/example/scripts

Global Settings

Various global settings can also be put in place. An important inclusion is CUDA availability and gpu index.

You can also give it a scope of what data you'd like to process, such as limiting by experiment or subject names. Note that these naming conventions must match you defined hierarchy for a dataset, as discussed in DATASET.

settings:
  CUDA: yes
  GPU_INDEX: -1
  SCOPE:
    experiment: BerkeleyOutdoorWalk
    subject: [Subject08, Subject10]

Defining a Pipeline

Here is where you list the functions you'd like to run on the data, defined by stages. The syntax matches how functions are called within the library, and can be nested.

For example:

stage:
- Database:
    dir:
      data: raw
      database: database
      
    overwrite: true

    stage:
    - Initialize:
    - Upload:

Would be equivalent to:

import labdat as ld

ld.Database.dir = {
  'data': 'raw',
  'database': 'database'
}
ld.Database.overwrite = True

ld.Database.Initialize()
ld.Database.Upload()

Database

For efficient storage and access, LabDAT pulls raw data from known filetypes and stores them in a centralized SQLite database.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
cfg		cfg
labdat		labdat
unparsed_legacyCode		unparsed_legacyCode
README.md		README.md
config.yml		config.yml
main.py		main.py
vid2img.sh		vid2img.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Lab Data Analysis Toolkit

Getting Started

Configuration

Directories

Global Settings

Defining a Pipeline

Database

Initialization

Uploading

Querying

Preprocessing

Analysis

Visualization

About

Uh oh!

Releases

Packages

Languages

danpanfili/labdat

Folders and files

Latest commit

History

Repository files navigation

Lab Data Analysis Toolkit

Getting Started

Configuration

Directories

Global Settings

Defining a Pipeline

Database

Initialization

Uploading

Querying

Preprocessing

Analysis

Visualization

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages