Managing Uncertainty in AI and Cognitive Computing

Posted by Anonymous and classified in Computers

Written on in English with a size of 273.79 KB

Understanding Uncertainty in Artificial Intelligence

Uncertainty arises when we are not 100% sure about the outcome of decisions. It appears in cases where conditions are neither completely true nor completely false.

Sources of Uncertainty in AI

  • Uncertain Data: Occurs when data is missing, noisy, inconsistent, ambiguous, or based on expert guesses.
  • Uncertain Knowledge: Occurs when knowledge has multiple causes/effects or causality is incomplete.

Reasons for Uncertainty

  • Partially observable environments: For example, hidden cards in a game.
  • Dynamic environments: Such as rapidly changing technology.
  • Incomplete agent knowledge: For example, unpredictable consequences.
  • Inaccessible environments: Such as unseen global events.

Ways to Handle Uncertainty

Uncertainty in AI can be handled through the following approaches:

  • Fuzzy Logic: Deals with partial truth values between completely true and false.
  • Probabilistic Reasoning: Uses probability to represent and manage uncertain knowledge.
  • Hidden Markov Models (HMMs): Handle sequential data with hidden states.
  • Neural Networks: Learn from uncertain or noisy data.
  • Default or Non-Monotonic Logic: Make assumptions unless contradicted (e.g., assuming a car doesn’t have a flat tire).
  • Rules with Fudge Factors: Assign probabilities to uncertain outcomes (e.g., “Sprinkler → 0.99 WetGrass”).

Types of Probability in AI

  • Prior Probability: The initial belief about an event before new evidence. Example: P(Cavity = true) = 0.1.
  • Posterior Probability: Probability updated after new evidence using Bayes’ rule. Example: P(Cavity | Toothache) = 0.8.
  • Conditional Probability: Probability of an event given another has occurred. Formula: P(A|B) = P(A ∧ B)/P(B).
  • Joint Probability: Probability of two or more events happening together. Example: P(Weather, Cavity) = matrix of values for all combinations.
  • Marginal Probability: Obtained by summing joint probabilities over other variables. Example: P(Toothache) = sum of all events where Toothache = true = 0.2.

Phases Involved in NLP

  1. Phonological Analysis: Deals with speech sounds.
  2. Morphological Analysis: Breaks words into morphemes (prefix, root, suffix).
  3. Lexical Analysis: Divides text into words/phrases and normalizes them (stemming, lemmatization).
  4. Syntactic Analysis: Identifies the grammatical structure of sentences.
  5. Semantic Analysis: Derives the meaning of sentences.
  6. Discourse Integration: Links sentences for contextual meaning.
  7. Pragmatic Analysis: Interprets meaning based on situational context.

Bayesian Networks and CPTs

A Bayesian Network is a directed acyclic graph (DAG) where each node represents a random variable and edges represent causal influence. Each node has a Conditional Probability Distribution (CPT) given its parents.

Key Features of Bayesian Networks

  • Represents conditional independence assumptions.
  • Provides a compact specification of the full joint distribution.
  • Applications: Diagnosis, decision support systems, and probabilistic reasoning.

Conditional Probability Table (CPT)

The CPT specifies the probability distribution of a variable given its parent nodes. For each combination of parent values, the CPT gives probabilities of child node outcomes. For example, for the variable Alarm with parents {Burglary, Earthquake}, the CPT might include:

  • P(Alarm | Burglary = true, Earthquake = true) = 0.95
  • P(Alarm | Burglary = true, Earthquake = false) = 0.94
  • P(Alarm | Burglary = false, Earthquake = true) = 0.29
  • P(Alarm | Burglary = false, Earthquake = false) = 0.001

Thus, CPTs are the numerical backbone of Bayesian networks, enabling probabilistic inference.

Introduction to Cognitive Computing

Cognitive Computing (CC) is a computerized model that simulates human thought processes in complex situations where answers may be ambiguous or uncertain. These systems can recognize, understand, analyze, memorize, and provide the best possible result close to human reasoning. They interact with humans naturally by using AI and related technologies.

Features of Cognitive Computing

  • Learning from Experience: Improves knowledge and performance continuously without reprogramming.
  • Hypothesis Evaluation: Generates and tests conflicting hypotheses.
  • Evidence-based Reporting: Justifies conclusions with confidence scores.
  • Pattern Discovery: Identifies patterns in structured and unstructured data.
  • Natural Learning Emulation: Mimics brain-like memory and knowledge organization.
  • NLP & Deep Learning: Extracts meaning from text, images, voice, and sensors.
  • Predictive Analytics: Uses statistical and algorithmic models to forecast outcomes.

Fundamental Principles of CC

  • Learning: Self-learning from large volumes, velocity, and varieties of data.
  • Modeling: Builds models to represent domain knowledge using internal and external data.
  • Hypothesis Generation: Produces multiple possible answers since systems are probabilistic, not deterministic.
  • Testing & Scoring: Hypotheses are trained, tested, and scored to select the most relevant answers.

Pros and Cons of Cognitive Computing

Advantages

  • Analytical Accuracy: Detects hidden patterns in huge datasets.
  • Efficiency: Improves business process automation.
  • Customer Interaction: Enables chatbots, assistants, and personalized recommendations.
  • Behavior Prediction: Predicts customer behavior trends.
  • Employee Productivity: Supports decision-making by analyzing complex datasets.

Disadvantages

  • Security Issues: Risk of cyberattacks on sensitive cognitive systems.
  • Long Development Cycle: Requires significant time to train models.
  • Slow Adoption: Organizations may hesitate due to cost and complexity.
  • Environmental Concerns: Large computational power consumption impacts sustainability.

Elements and Principles of Cognitive Computing

Core Elements

  • Infrastructure & Deployment: Scalable cloud-based, parallel environment.
  • Data Access & Management: Classifies, cleans, and prepares varied data.
  • Corpus & Ontologies: Repository managing structured and unstructured knowledge.
  • Data Analytics Services: Extracts patterns, predicts, and recommends.
  • Continuous Machine Learning: Iterative hypothesis testing and model updates.
  • Tools & Learning Process: NLP, deep learning, and sensor data tools.
  • Presentation & Visualization: Interfaces to display and communicate results.

Operational Principles

  • Data Handling: Must access, manage, and analyze problem-related data.
  • Hypothesis Generation & Scoring: Produces multiple solutions with confidence levels.
  • Self-Improvement: Models continuously update based on user interaction and new data.
  • Confidence Levels: Success is measured by confidence in recommendations.
  • Human Support: Confidence improvement is possible with human intervention.

The Role of NLP in Cognitive Computing

NLP enables Natural Language Understanding and Generation. It processes language by forming phrases, assigning meaning, and making inferences. Core components include POS tagging, named entity recognition, word sense disambiguation, and coreference resolution. It acts as a cognitive technology by mimicking sensory input like speech and text.

Knowledge Representation Methods

  • Taxonomy: Hierarchical classification of objects; complete, consistent, and unambiguous categories. Represented via RDF and schema.
  • Ontology: Vocabulary to define entities, attributes, and relationships within a domain; used for semantic content organization.
  • Simple Tree: Parent-child logical structures represented as tables or trees.
  • Semantic Web: Interconnected mesh of machine-readable data using RDF, SPARQL, and OWL.

Taxonomy vs. Ontology

TaxonomyOntology
Hierarchical structure among concepts.Identifies concepts and relationships in a domain.
StaticDynamic and domain-specific.
Defines structure and terminology.Defines content and context-based relationships.
Considers a single type of relation.Handles multiple, complex relations.
Adds structure to unstructured data.Builds models for NLP and text analytics.
Represented via RDF/RDFS.Represented using LISP, OWL.

AI vs. Cognitive Computing

Artificial Intelligence (AI)Cognitive Computing (CC)
Produces accurate results using autonomous algorithms.Relies on human-like reasoning and beliefs.
Works autonomously.Dependent on human interpretation.
Generates algorithms and decisions.Provides solutions; final decision made by humans.
Uses pre-trained algorithms and pattern recognition.Uses prediction and analysis imitating human thought.
Application: retail, finance, manufacturing.Application: healthcare, customer service, industries.

INkf8DD0ioMSVp2BYAAAAASUVORK5CYII=

Fuzzy Logic and Fuzzy Sets

A fuzzy set is a mathematical representation of vagueness and uncertainty, where each element has a degree of membership between 0 and 1.

Properties of Fuzzy Sets

  • Support: The set of all elements having non-zero membership. Supp(A) = {x ∈ X | μA(x) > 0}.
  • Core: The set of all elements with full membership. Core(A) = {x ∈ X | μA(x) = 1}.
  • Height: The maximum membership value in a fuzzy set. h(A) = max[μA(x)].
  • Normality: A fuzzy set is normal if at least one element has membership = 1.
  • Subnormality: If height h(A) < 1, the set is called subnormal.
  • Normalization: A subnormal set can be normalized by dividing each membership value by the height.
  • Convexity: A fuzzy set A is convex if for all x1, x2 ∈ X and λ ∈ [0,1]: μA(λx1 + (1-λ)x2) ≥ min[μA(x1), μA(x2)].
  • Concavity: A set is concave if the above inequality holds in the reverse direction.

Defuzzification Techniques

Defuzzification is the process of converting a fuzzy output into a crisp, precise value for decision-making.

  • Centroid Method (Center of Gravity / COG): Finds the balance point of the fuzzy set. z* = ∫z·μ(z)dz / ∫μ(z)dz. It is the most widely used method and gives stable results.
  • Mean of Maximum (MOM): Takes the average of all values that have the highest membership degree. z* = Σzi / n. It is easy to compute but may not represent the overall shape.

Advantages and Disadvantages of Fuzzy Logic

Advantages

  • Handles uncertainty and imprecision effectively.
  • Works well with nonlinear, complex systems.
  • Requires less mathematical modeling compared to classical methods.
  • Provides human-like reasoning through linguistic variables.

Disadvantages

  • Rule explosion: A large number of rules are needed for big systems.
  • No standard design method: Requires expert knowledge for membership functions.
  • Computationally intensive: Can be slow for large-scale problems.
  • Approximate results: Not always precise and can be difficult to validate.

Related entries: