Cultural Twins

Terminology

Written by manusim | 03-Jul-2025 18:34:27

An essential and often forgotten tool, together with ontologies and taxonomies, to organise and model knowledge

Terminologies, ontologies and taxonomies individually and collectively enhance knowledge organisation and representation by providing structured methods for defining, classifying, and relating information within various domains. And for linguists and computational linguists domains are an absolute corner stone.
 

Defining what these concepts are

1. Terminology

  • Enhances clarity and precision: Terminology involves the use of specialized, precise terms, distinguishing them from everyday words, which is crucial for scientific/technical communication.
  • Promotes consistency and standardisation: It focuses on the consistency and standardization of terminology within specific research areas, such as architectural variables, ensuring that researchers "speak the same language" and reducing ambiguity. This directly supports more effective knowledge organisation.
  • Facilitates knowledge representation and transfer: Terminology plays a vital role in knowledge representation and transfer, especially in multidisciplinary communication.
  • It is applicable in both general and specialized fields, enabling clear communication of concepts.

2. Ontologies

  • Provide structured knowledge models: Ontologies offer an overview of what they are and their role in research domains, enabling the semantic annotation of scholarly publications. This means adding meaning and context to information, making it more discoverable and understandable.
  • Represent complex relationships and contexts: They are designed to represent academic contexts at universities and can provide a framework for understanding concepts and their relationships. For example, they can be used in developing frameworks for risk analysis based on ontological perspectives.
  • Inform research paradigms: Ontologies are integral to research paradigms, influencing research question development and providing a foundational structure for understanding a domain.

3. Taxonomies

  • Enable hierarchical classification: Taxonomies provide hierarchical classification systems. This is fundamental for organising diverse information, such as in biodiversity conservation, ensuring scientific credibility.
  • Support quality assurance and integration: They can be used as taxonomy-based approaches to quality assurance of ontologies, highlighting their role in verifying the structure and consistency of more complex knowledge systems.
  • Taxonomies can also be integrated into ontological frameworks, demonstrating their foundational role in structuring knowledge.

How they interact

The fields of terminology, ontologies, and taxonomies work together to create more precise, structured, and semantically rich knowledge representations.
  • Terminology provides the foundational vocabulary: The precise and consistent vocabulary established by terminology forms the building blocks for more complex knowledge structures.
  • Taxonomies offer hierarchical organisation: Taxonomies provide the necessary hierarchical classification and categorisation of these terms, creating a structured framework for organising information.
  • Ontologies build comprehensive models: Ontologies then build upon this foundation by providing a more comprehensive and explicit representation of concepts, their properties, and intricate relationships, often integrating with taxonomies and utilising precise terminology.
By working in concert, these fields significantly improve how knowledge is organised, represented, and accessed, moving beyond simple data storage to enable deeper understanding and semantic interoperability.
Their synergy is evident in applications such as: Quality assurance in ontology development, where taxonomy-based methods are used to identify errors in ontologies; the challenges and processes of aligning dynamic taxonomies with ontological reasoning in complex fields like biology, showcasing their integrated application for evolving knowledge domains.
 

Terminology as a tool to represent and organise knowledge

Terminology itself functions as a fundamental semantic tool for organising and representing knowledge. The core activities and characteristics of terminology are inherently semantic in nature, aiming to establish clarity, precision, and consistency of meaning.
The very nature and application of terminology implies sematic techniques:
Defining and Clarifying Concepts: Terminology focuses on the concepts of defining term and terminology, distinguishing them from everyday words. This process of defining terms is a primary semantic technique, as it involves explicitly stating the meaning, scope, and boundaries of a specialised concept. This ensures that the terms used in scientific communication are precise and unambiguous, thereby enhancing clarity and precision.
Promoting Consistency and Standardisation: A key aspect of terminology is the pursuit of consistency and standardization within specific research areas. For example, in architectural variables, ensuring consistency in terminology is a semantic technique that reduces ambiguity and ensures researchers "speak the same language". This standardisation is crucial for avoiding misinterpretation and enabling shared understanding across a domain, which is a core semantic goal.
Facilitating Knowledge Representation and Transfer: Terminology plays a vital role in knowledge representation and transfer, especially in multidisciplinary communication. By providing a precise and consistent vocabulary, terminology acts as the foundational building block for constructing more complex knowledge structures like ontologies and taxonomies. This foundational vocabulary is essential for semantically annotating information, making it meaningful and transferable between different contexts or disciplines. Terminology is applicable in both general and specialized fields, enabling clear communication of concepts.
Structuring Specialised Communication: The very use of specialised, precise terms is a semantic strategy to differentiate scientific communication from everyday language. This deliberate choice of terms impacts how scientific articles are understood and cited, with specialised terminology potentially reducing citations if not properly managed or understood.
Hence, the selection and management of terminology are semantic techniques influencing how knowledge is disseminated and consumed.
Reviewing and Evolving Terminology: The continuous review and development of terminology in dynamic fields, such as digital domains, involves semantic analysis of how terms are used and defined over time. This ongoing process is a semantic technique to ensure that the vocabulary remains accurate and relevant to the evolving understanding of a domain.
 

Where are terminologies relevant?

Corpus-based Terminology Extraction and Analysis

Importance:
  • Essential for identifying terminology systematically from domain-specific texts.
  • Supports standardization, consistency, and semantic clarity within specialized fields.
Current Developments:
  • Use of machine learning and deep learning models (e.g., BERT, GPT, transformers) to automate extraction and validation.
  • Incorporation of multi-word terms, contextual embedding, and semantic clustering for accuracy improvements.
Terminological Work Adds:
Precision and accuracy of domain-specific vocabulary, enabling improved NLP downstream applications (translation, indexing, search engines).

Terminology and Ontology Integration

Importance:
  • Crucial for semantic interoperability across information systems, knowledge bases, and ontologies.
  • Enhances semantic search, knowledge discovery, and AI-driven reasoning.
Current Developments:
  • Advances in ontology alignment and semantic mapping via terminological anchors.
  • Development of hybrid models combining linguistic terminological methods and formal ontology engineering (e.g., OWL).
Terminological Work Adds:
  • Robust semantic definitions, facilitating clearer concept relationships and interoperability between diverse ontologies.

3. Terminology Management and Knowledge Organization Systems (KOS)

Importance:
  • Supports knowledge structuring and information retrieval, critical in libraries, digital repositories, and information management frameworks.
  • Provides coherent access points for indexing, categorization, and retrieval.
Current Developments:
  • Integration of terminologies into linked open data (LOD) and SKOS (Simple Knowledge Organization System).
  • Dynamic terminology management systems integrated with semantic web technologies and graph databases.
Terminological Work Adds:
  • Precise and systematic vocabulary facilitating interoperability and accurate retrieval in complex information landscapes.

4. Terminology and Semantic Web Technologies

Importance:
  • Facilitates semantic interoperability in web-scale data integration.
  • Enables machine-readability and structured information exchange on the web.
Current Developments:
  • Advances in terminological data as linked data sets, enabling wider semantic integration and reasoning (e.g., RDF-based termbases).
  • Enhanced methods for ontology-driven terminology validation and maintenance on the semantic web.
Terminological Work Adds:
  • Semantic precision and explicitness, allowing accurate automated reasoning and inferencing across heterogeneous datasets.

5. Terminology and Knowledge Graph Construction

Importance:
  • Provides a foundation for entity linking, semantic disambiguation, and structured knowledge representation in knowledge graphs.
  • Critical for advanced search, recommendation systems, and automated reasoning in AI applications.
Current Developments:
  • Automated techniques using NLP and terminological analysis to populate and refine knowledge graphs.
  • Methods integrating terminological resources (e.g., controlled vocabularies, thesauri) to improve knowledge graph quality and coherence.
Terminological Work Adds:
  • Enhanced accuracy and comprehensiveness of nodes and relationships within knowledge graphs, improving usability for downstream AI tasks.

 

research scope—focusing on legal and machinery/technical/engineering domains, highlighting multilingual corpora, emphasizing first the frameworks, then the computational approaches, and providing concrete real-world use cases. I've also provided two structured tables for quantitative and qualitative methodologies.

Now let's explore 2 research frameworks and see why and how they matter specifically for multilingual legal and engineering contexts:

 

1. Semantic Web Frameworks

  • Relevance:
    Enable explicit semantic definitions and relationships across multilingual terminologies, supporting legal compliance and technical standardization.
  • Example technologies:
    RDF, OWL, SKOS, Lemon-OntoLex.
  • Current computational approaches:
    NLP methods for ontology learning, linking terminologies through semantic alignment, using transformer-based models (BERT) for extracting terms in multiple languages.
Real-World Use Cases:
  1. European Union Legislation Ontology (EuroVoc)
    Multilingual semantic interoperability of EU legal documents.
  2. DBpedia (Multilingual engineering knowledge)
    Multilingual linked data from Wikipedia used in semantic search and AI assistants.
  3. LexVoc:
    Semantic multilingual legal vocabulary supporting EU directives implementation.
  4. Linked Open Terms (LOT)
    Multilingual linked terminologies in mechanical engineering.
  5. ISO Terminology Standards (ISO 1087 and ISO/IEC 81346)
    Structured terminological resources integrated via Semantic Web standards.

2. AI and Knowledge Graph Frameworks

  • Relevance:
    Enhance structured knowledge discovery and semantic representation, particularly in complex legal and engineering fields requiring terminological precision.
  • Example technologies:
    Knowledge graphs, Neo4j graph databases, transformer-based NLP models, large language models (LLMs).
  • Current computational approaches:
    Automatic terminology extraction, semantic disambiguation using embeddings, entity linking, term clustering.
Real-World Use Cases:
  1. Google Knowledge Graph (Technical standards and specifications):
    Multilingual information retrieval for engineering terminologies.
  2. Wikidata (Legal Entities and Terms):
    Multilingual structured data for legal research and semantic referencing.
  3. Engineering Ontology Graph (Siemens):
    Knowledge graphs linking multilingual technical vocabularies for AI-driven diagnostics.
  4. World Intellectual Property Organization (WIPO) Knowledge Graph:
    Multilingual IP terminologies and patent retrieval.
  5. LexisNexis Legal Knowledge Graph:
    Semantic representation of legal terminology and document retrieval across jurisdictions.

 


Quantitative vs Qualitative Methodologies

1. Quantitative Terminological Analysis Methods

Methodology Description Use cases/examples
Frequency Analysis Statistical frequency-based extraction of terms. Patent terminology mining
Co-occurrence Analysis Identifying terms frequently appearing together. Technical manuals corpus analysis
TF-IDF and Statistical Scoring Measures term significance in specific corpora. Multilingual legal text retrieval
Clustering and Term Classification Grouping terms based on semantic similarity. Engineering standards harmonization
Network Analysis Visualization of term relationships. Patent citation terminologies

 

2. Qualitative Terminological Analysis Methods

Methodology Description Use cases/examples
Concept Analysis Semantic clarification and definition of terms Legal ontology alignment (EuroVoc)
Domain Expert Validation Manual verification by specialists Engineering glossaries (ISO committees)
Semantic Feature Analysis Identifying semantic attributes of terms Multilingual legal semantics (LexVoc)
Contextual Semantic Analysis Terms analyzed within textual contexts Technical manuals and specification drafting
Comparative Terminological Studies Contrastive analysis across languages/domains Cross-jurisdictional legal terminology

 

To sum it up, terminology is essential because it facilitates:

    • Precision and Consistency:
      Reduces ambiguity in multilingual legal and technical documentation.
    • Interoperability:
      Facilitates semantic coherence across databases and semantic web frameworks.
    • Enhanced AI Applications:
      Improves accuracy and contextual understanding of NLP systems, knowledge graphs, and semantic search engines.

Which makes translators essential, because terminology work is an essential part of  their training.


  • Key References:

    • L'Homme, M.C. (2020). Lexical Semantics for Terminology: An Introduction. John Benjamins Publishing.

    • Ha, L. Q., & L’Homme, M.C. (2022). "Neural language models and terminology extraction: assessing transformer-based models." Terminology, 28(1), 6-35.

    • Gillam, L., Tariq, M., & Ahmad, K. (2021). "Terminology-driven ontology learning from text." Knowledge-Based Systems, 212, 106595.

    • Costa, R., & Roche, C. (2023). "From terminology to ontology: methodological considerations." Terminology, 29(2), 134-157.

    • Zeng, M. L. (2022). "Interoperability of Knowledge Organization Systems in the linked data environment." International Journal on Digital Libraries, 23, 177–192.

    • Budin, G., & Picht, H. (2022). "Terminology management systems and their integration into knowledge management frameworks." Terminology, 28(2), 143-165.

    • Cimiano, P., McCrae, J., & Montiel-Ponsoda, E. (2020). "Ontology lexicalization: The lemon perspective." Applied Ontology, 15(1), 19-43.

    • Montiel-Ponsoda, E., & Rodríguez-Doncel, V. (2022). "Terminology and the Semantic Web: Integration strategies." Semantic Web, 13(2), 233-251.

    • Ehrlinger, L., & Wöß, W. (2022). "Towards a definition of Knowledge Graphs." Information Systems, 104, 101895.

    • Gracia, J., & Montiel-Ponsoda, E. (2021). "Linguistic Linked Data: Representation, generation and applications." Semantic Web, 12(4), 635-637.