Dataset Registry
Search, filter, inspect proof, and compose fine-tune packs from local registry metadata and NAS-backed assets.
verifiedVerified publicNAS proofcc-by-4.0public
CRE Underwriting Royal Jelly v1Commercial real estate underwriting corpus for rent-roll normalization, NOI reasoning, diligence extraction, and acquisition memo structure.
Commercial Real EstateUnderwritingjsonlcsvinstruction-tuningextractionsummarization
verifiedVerified publicNAS proofcc-by-4.0public
Compute GPU Market Comps v1GPU and compute market comparison corpus for infrastructure pricing, utilization notes, hardware valuation, and resale reasoning.
Compute ValuationMarket Compsjsonlparquetrankingextractionqa
verifiedMembers indexedNAS proofdefendable-community-researchmembers
Medical Billing Appeals v1Medical administrative and appeal-reasoning corpus for denial classification, billing workflow extraction, and appeal drafting research.
Medical BillingAppealsjsonlmarkdownclassificationextractionsummarization
verifiedVerified publicNAS proofcc-by-4.0public
FCRA Credit Dispute Letters v1FCRA dispute letter and credit repair workflow corpus for issue taxonomy, bureau response parsing, and editable correspondence generation.
Credit RepairDispute Lettersjsonlmarkdowninstruction-tuningclassificationextraction
verifiedVerified publicNAS proofcc-by-4.0public
SAM.gov Opportunity Intelligence v1Government opportunity and grants intelligence corpus for qualification, capture planning, summarization, and ranking workflows.
Government ContractingOpportunity Intelligencejsonlcsvsummarizationrankingentity-resolution
verifiedVerified publicNAS proofmitpublic
Legal Letter Templates v1Legal and professional correspondence corpus for notices, demand letters, dispute responses, and structured template generation.
Legal LettersTemplatesmarkdownjsonlinstruction-tuningsummarization
verifiedVerified publicNAS proofapache-2.0public
Energy Site Assessment v1Energy site assessment corpus for infrastructure screening, interconnect diligence, constraint extraction, and feasibility scoring.
EnergySite Assessmentjsonlyamlrankingqaextraction
verifiedMembers indexedNAS proofdefendable-community-researchmembers
Local Service Leads Jupiter v1Local service and market lead intelligence corpus for enrichment, categorization, outreach readiness, and source confidence workflows.
Local ServicesLead Intelligencecsvjsonlclassificationentity-resolutionranking
verifiedVerified publicNAS proofapache-2.0public
Mining Ops Q&A v1Mining and blockchain operations corpus covering uptime, power, thermal constraints, maintenance workflows, and operations Q&A.
Mining / CryptoOperations Q&Ajsonlqainstruction-tuning
verifiedVerified publicNAS proofcc0-1.0public
General Agent Instruction Pairs v1Agent instruction corpus for tool use, task decomposition, refusal boundaries, instruction following, and concise operational responses.
General Instruction TuningAgent Instructionsjsonlparquetinstruction-tuningqasummarization
verifiedMembers indexedNAS proofdefendable-community-researchmembers
Minechain Master Inventory v1Minechain master medical instruction corpus with gold and platinum JSONL inventories, record counts, and SHA256 receipts.
Clinical MedicineMedical Knowledge SFTjsonlinstruction-tuningqasummarization