---
name: "EntoLinguistics"
description: "Expertise in entomological scientific language, terminology networks, corpus methods, CACE lexical standards, and Active Inference–aligned discourse analysis."
tags: ["entomology", "scientific-communication", "terminology-networks", "corpus-linguistics", "active-inference", "cace", "semantic-entropy", "myrmecology"]
---

# Ento-Linguistics: Language, Ambiguity, and Scientific Communication in Entomology

**Daniel Ari Friedman** · **Tucker Cahill Chambers** (2026) · Entomology · discourse analysis

## Instructions

Use this skill when working with topics related to **entomological terminology, scientific communication, corpus-based term extraction, terminology networks, semantic ambiguity, or CACE (Clarity, Appropriateness, Consistency, Evolvability)**.

When applying this skill:

1. Ground claims in the six Ento-Linguistic domains and distinguish metaphor-laden from mechanistic descriptions where modeling is at stake.
1. Treat terminology reform as epistemic hygiene: align lexicon with generative models, reproducible pipelines, and explicit scoring (e.g., CACE, semantic entropy).

## Key Concepts

- **Ento-Linguistic domains** — Unit of Individuality; Power & Labor; Behavior & Identity; Sex & Reproduction; Kin & Relatedness; Economics.
- **Terminology networks** — co-occurrence structure, modularity, cross-domain bridging terms.
- **Semantic entropy** — ambiguity of term usage across contexts.
- **CACE** — Clarity, Appropriateness, Consistency, Evolvability as meta-standards for terms.
- **Active Inference** — language as hyper-prior; community model updating and misspecification.
- **Markov blanket** — formal boundaries relevant to individuality and scale in models.

## Methods & Techniques

- Literature mining and corpus construction (e.g., PubMed, arXiv) with deduplication and NLP preprocessing.
- Automated term extraction, domain seed expansion, n-gram and compound-term handling.
- Semantic clustering of contexts and entropy-based ambiguity flags.
- Statistical comparison across domains; network centrality and bridging-term analysis.
- Rhetorical and framing analysis; per-term CACE scoring.

## Key Findings

- A 369-publication corpus yields structured terminology statistics (e.g., 888 candidate terms, 261 domain-assigned) and network structure with cross-domain bridging, notably in Power & Labor.
- High-frequency social-metaphor terms (e.g., queen, worker, caste) co-occur with hierarchical framing that can misalign with stigmergic colony dynamics.
- A substantial share of domain-assigned terms show context-dependent semantic drift, complicating scale-consistent modeling.

## Prerequisites

- Social insect biology and basic myrmecology terminology.
- Introductory statistics and NLP concepts (tokenization, clustering, multiple testing).
- Active Inference or Bayesian modeling at a conceptual level.

## 🎯 Consulting & Tutoring

[Daniel Ari Friedman, PhD](https://danielarifriedman.com/) is available for AI Research Consulting and Tutoring related to this skill.

## Related Skills

See [BIBLIOGRAPHY.md](../../pages/BIBLIOGRAPHY.md) for the complete publication catalog and related papers.

**Code and data**: https://github.com/docxology/ento_linguistics
