The most enigmatic aspect of consciousness is the fact that it is felt, as a subjective sensation. This particular aspect is explained by the theory proposed here. The theory encompasses both the computation that is presumably involved and the way in which that computation may be realized in the brain's neurobiology. It is assumed that the brain makes an internal estimate of an individual's own evolutionary fitness, which can be shown to produce an irreducible, distinct cause. Communicating components of the fitness estimate (either for external or internal use) requires inverting them. Such inversion can be performed by the thalamocortical feedback loop in the mammalian brain, if that loop is operating in a switched, dual-stage mode. A first (nonconscious) stage produces forward estimates, whereas the second (conscious) stage inverts those estimates. It is argued that inversion produces irreducible, distinct, and spatially localized causes, which are plausibly sensed as the feeling of consciousness.
A system with artificial intelligence usually relies on symbol manipulation, at least partly and implicitly. However, the interpretation of the symbols - what they represent and what they are about - is ultimately left to humans, as designers and users of the system. How symbols can acquire meaning for the system itself, independent of external interpretation, is an unsolved problem. Some grounding of symbols can be obtained by embodiment, that is, by causally connecting symbols (or sub-symbolic variables) to the physical environment, such as in a robot with sensors and effectors. However, a causal connection as such does not produce representation and aboutness of the kind that symbols have for humans. Here I present a theory that explains how humans and other living organisms have acquired the capability to have symbols and sub-symbolic variables that represent, refer to, and are about something else. The theory shows how reference can be to physical objects, but also to abstract objects, and even how it can be misguided (errors in reference) or be about non-existing objects. I subsequently abstract the primary components of the theory from their biological context, and discuss how and under what conditions the theory could be implemented in artificial agents. A major component of the theory is the strong nonlinearity associated with (potentially unlimited) self-reproduction. The latter is likely not acceptable in artificial systems. It remains unclear if goals other than those inherently serving self-reproduction can have aboutness and if such goals could be stabilized.
Purpose and meaning are necessary concepts for understanding mind and culture, but appear to be absent from the physical world and are not part of the explanatory framework of the natural sciences. Understanding how meaning (in the broad sense of the term) could arise from a physical world has proven to be a tough problem. The basic scheme of Darwinian evolution produces adaptations that only represent apparent ("as if") goals and meaning. Here I use evolutionary models to show that a slight, evolvable extension of the basic scheme is sufficient to produce genuine goals. The extension, targeted modulation of mutation rate, is known to be generally present in biological cells, and gives rise to two phenomena that are absent from the non-living world: intrinsic meaning and the ability to initiate goal-directed chains of causation (active causation). The extended scheme accomplishes this by utilizing randomness modulated by a feedback loop that is itself regulated by evolutionary pressure. The mechanism can be extended to behavioural variability as well, and thus shows how freedom of behaviour is possible. A further extension to communication suggests that the active exchange of intrinsic meaning between organisms may be the origin of consciousness, which in combination with active causation can provide a physical basis for the phenomenon of free will.