Entomology · Paper · 2026

Ento-Linguistics: Language, Ambiguity, and Scientific Communication in Entomology

Zenodo

Catalog Row109
Citation KeyFriedman2026EntoLinguisticsLanguageAmbiguity109
Paper FolderAvailable

Overview

Extracted from the local paper documentation when available.

Scientific language does not merely describe biological phenomena; it actively constitutes the generative models through which researchers parse complex systems. This paper makes three core contributions to understanding—and correcting—the epistemic consequences of this constitutive role. First, we introduce a six-domain Ento-Linguistic framework that decomposes the terminological landscape of insect research into analytically tractable themes, isolating domains where anthropomorphic language most severely distorts causal modeling. Second, we develop an open-source computational pipeline that integrates automated term extraction, co-occurrence network construction, and information-theoretic ambiguity scoring with principles from Active Inference and Complex Systems Theory. Third, we propose and validate four evidence-based meta-standards—Clarity, Appropriateness, Consistency, and Evolvab

entomologyscientific communicationterminology networksActive Inferencecorpus linguisticsCACEsemantic entropymyrmecology

Use Notes

Concise findings and methods pulled from README/SKILL documentation.

Findings / Concepts
  • A six-domain Ento-Linguistic framework (Unit of Individuality; Power & Labor; Behavior & Identity; Sex & Reproduction; Kin & Relatedness; Economics) for analyzing terminological friction in insect research.
  • An open-source, reproducible pipeline: literature mining (PubMed, arXiv), text processing, term extraction and domain assignment, semantic entropy, statistical tests, conceptual network and rhetorical analysis, and CACE scoring.
  • CACE meta-standards (Clarity, Appropriateness, Consistency, Evolvability) as a protocol for lexical engineering and term evaluation.
Methods / Techniques
  • Corpus construction via PubMed and arXiv queries; deduplication and preprocessing (tokenization, lemmatization, n-grams) as specified in the paper and supplemental methods.
  • Term extraction and domain classification via seed lexicons and co-occurrence expansion; semantic entropy via clustered context vectors; pairwise and ANOVA tests with Benjamini–Hochberg correction where applicable.
  • Conceptual overlap networks, bridging-term identification, rhetorical and framing analysis, and per-term CACE scoring.

Citation

Plain-text citation for quick reuse.

Friedman, Daniel Ari. 2026. Ento-Linguistics: Language, Ambiguity, and Scientific Communication in Entomology. Zenodo.

Primary source Documentation BibTeX