🔎 Project FINDIT

Facilitating Intra-Departmental Navigation of Data and Information Transfer

📌 Challenge Summary

Project FINDIT is a data management initiative for the Department of Psychology and Biobehavioral Science (PBS).
The project develops tools called "finders" (or crawlers) that automatically scan directories, extract metadata, and perform data management or preprocessing tasks.

🎯 Goals

Build basic finders for:
- Data inventory (what data we have and where it is stored)
- Metadata extraction
Extend to advanced finders for:
- Automated preprocessing of data
- Domain-specific tasks (e.g., detecting EEG data and performing automated artifact correction)

🏗️ Project History

Last Year’s Hackathon:
- Built an inventory finder that profiles data stored in a given directory.
- Built a basic participant-first finder for reporting available data based on an ID
  - Input: Subject ID
  - Output: A report showing all available data for that participant (e.g., imaging, clinical, sleep tracking, questionnaires).
  - Goal: Make it easy for researchers to quickly determine what data is available for any given research participant.
This Year’s Hackathon:
- Focus on optimizing participant-first finder
- Improving technical documentation

🚀 Usage

📂 file_crawler

file_crawler is an R project that recursively scans directories to create an inventory of files.
It captures file metadata (paths, names, sizes, modified times, and types) and, for SAS datasets (.sas7bdat), calculates the number of unique patient identifiers (MRNs).

📂 participant_first_findr

An R-based workflow for crawling directories based on patient MRNs across multiple file types (CSV, Excel, SAS).
Optimized for large datasets with parallel processing.

🚀 Future Directions

Improve automation for large-scale data workflows
Expand finder functionality to support additional data types
Integrate into department-wide data management pipelines

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Archive		Archive
File Crawler		File Crawler
Participant First Finder		Participant First Finder
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔎 Project FINDIT

📌 Challenge Summary

🎯 Goals

🏗️ Project History

🚀 Usage

📂 file_crawler

📂 participant_first_findr

🚀 Future Directions

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🔎 Project FINDIT

📌 Challenge Summary

🎯 Goals

🏗️ Project History

🚀 Usage

📂 file_crawler

📂 participant_first_findr

🚀 Future Directions

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages