Skip to content
View sothiss's full-sized avatar

Highlights

  • Pro

Block or report sothiss

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
sothiss/README.md

Bio: Open Science & Computational Ethology

I am an independent researcher bridging data engineering with an academic background in law. My work focuses on building the open-science, reproducible data infrastructure that applied canine ethology desperately requires to overcome its current replication crisis. By shifting the paradigm from fragmented, subjective narratives toward deterministic data pipelines, my objective is to replace dogmatic assertions with traceably verified, physical realities.

Core Philosophy: Data Over Dogma True scientific rigor requires building scalable infrastructure to test behavioral claims objectively, moving the industry away from institutional appeals to authority and toward transparent code.

🔬 Active Research & Engineering Projects

🛠️ EthoPipe

An open-science, Python-based Extract, Transform, Load (ETL) pipeline engineered to serve as a strict digital gatekeeper for ethological data. It transforms unstructured, messy observations into machine-readable datasets viable for quantitative peer review.

  • Structural Reliability Matrix: Utilizes a validation engine built on Python and Pydantic to instantly reject malformed, subjective inputs (e.g., rejecting qualitative tags like "angry" or clamping physiological metrics to valid veterinary baselines like 30–250 BPM).
  • Deterministic Semantic Parsing: Integrates Google Gemini 2.5 Pro as a strict text parser. By clamping the model's temperature parameter to 0.0 and enforcing Structured JSON Outputs, it extracts data matching peer-reviewed operational definitions without the risk of LLM hallucinations.
  • Global Interoperability: Rigorously maps internal variables to Darwin Core (DwC) metadata standards (e.g., dwc:individualID, dwc:measurementType, and dwc:basisOfRecord as 'HumanObservation') to enable seamless integration with international informatics platforms like GBIF and OBIS.

📊 The Transparency Project

A global analytics dashboard engineered to track data regarding aversive training legislation and behavioral euthanasia outcomes across multiple international jurisdictions.

  • Architecture: Deployed via a Hybrid Framework combining a polished presentation layer on GitHub Pages with a live Python server hosted on Streamlit Community Cloud, integrated with a Firebase backend.
  • Legislative Baselines: Audits and visualizes legal frameworks, including localized benchmarks such as São Paulo Municipal Law No. 18.314/2025, which regulates punitive handling tools.

🛠️ Technical Proficiencies & Toolkit

  • Languages & Core Development: Python, Structured JSON, HTML5/CSS3
  • Data Engineering & Validation: Pydantic, ETL Pipeline Design
  • AI & Cloud Infrastructure: Google Gemini 2.5 Pro API (Deterministic Optimization), Firebase, Streamlit Community Cloud, GitHub Pages
  • Metadata & Informatics Standards: Darwin Core (DwC) Schema Mapping

🪪 Academic Identity & Verification

To maintain complete transparency and integration within the peer-reviewed ecosystem, my contributions are traceably linked across platforms:

  • ORCID iD: 0000-0002-1825-0097 (Example format—replace with your actual iD)
  • Digital Object Identifiers (DOIs): Automated release pipeline linking GitHub tags to Zenodo to push verified open-source software data straight to my official scientific record.
  • Manuscript Compliance: Embedded YAML metadata schemas matching JOSS (Journal of Open Source Software) automated compilation criteria.

📬 Connect & Collaborate

If you are a researcher, data scientist, or epidemiologist interested in expanding reproducible, open-source frameworks for animal welfare, feel free to open an issue or reach out:

  • Global Dashboard: thetransparencyproject.me
  • Local Case Study Blog: rotapetplanaltopaulista.com (Specialized Portuguese landing page for São Paulo legislative tracking)

-->

Pinned Loading

  1. ethopipe ethopipe Public

    Python 2