Blog | Events | Multimedia | About | Purpose | Programs | Publications | Staff | Contact | Join   
     Login      Register    

Support the IEET




The IEET is a 501(c)3 non-profit, tax-exempt organization registered in the State of Connecticut in the United States. Please give as you are able, and help support our work for a brighter future.

Via PayPal




Technoprogressive? BioConservative? Huh?
Quick overview of biopolitical points of view


whats new at ieet
NASA Announces Design For New Deep Space Exploration System

A New Era of Space Exploration

Back to the 60s, Back to the Future, and Onwards to the Stars

Where Human Brains Far Surpass Computers

Volume of Criticism of H+  Published

Rudy Rucker - Transreal in Los Gatos

Rudy Rucker - Beyond Machines: The Year 3000

Why Teaching Business Ethics can be Difficult

Cyberpunk SF/Mathematics Legend Rudy Rucker’s “Nested Scrolls”: An Interview

Surfing the Waves of Change


ieet books

Smart Mice, Not-So-Smart People: An Interesting and Amusing Guide to Bioethics
Author
by Arthur Caplan

From Transgender to Transhuman: A Manifesto On the Freedom Of Form
by Martine Rothblatt

Freedom of Religion and the Secular State
by Russell Blackford

The Olympics: The Basics
by Andy Miah and Beatriz Garcia


comments

David Pearce on 'Five Top Reasons Transhumanism Can Eliminate Suffering' (Feb 22, 2012)

Cybernoetic Man on 'IEET Consults for Japanese Neurotech Consortium' (Feb 22, 2012)

Pastor_Alex on 'Five Top Reasons Transhumanism Can Eliminate Suffering' (Feb 22, 2012)

Cybernoetic Man on '"Headhunter" explains why USA high-tech industry is declining' (Feb 22, 2012)

Cybernoetic Man on '"Headhunter" explains why USA high-tech industry is declining' (Feb 22, 2012)







Subscribe to IEET News Lists

Daily News Feed

Longevity Dividend List

Catastrophic Risks List

Biopolitics of Popular Culture List

Technoprogressive List

Trans-Spirit List



Also check out technoprogressive multimedia on Thoughtware.tv


IEET > Vision > Futurism > Fellows > Ben Goertzel

Print Email permalink (2) Comments (824) Hits •  subscribe Share on facebook Stumble This submit to reddit submit to digg submit to Twitter


Lifelong Forgetting: A Critical Ingredient of Lifelong Learning


Ben Goertzel
Ben Goertzel
AAAI Publications

Posted: Jan 24, 2012

Of all the aspects differentiating lifelong learning from shorter-term, more specialized learning, perhaps none is more central than forgetting – or, to frame the issue more generally and technically, ”memory access speed deprioritization.” This extended abstract reviews some of the ideas involved in forgetting for lifelong learning systems, and briefly discusses the forgetting mechanisms used in the OpenCog integrative cognitive architecture.

Your fingers weave quick minarets ... Speak in secret alphabets…
I light another cigarette ... Learn to forget, learn to forget
 
      – Jim Morrison, ”Soul Kitchen”

Defining Forgetting

In ordinary human discourse, the word ”forget” has multiple shades of meaning. It can refer to the irreversible elimination of a certain knowledge item from memory; or it can mean something milder, as in cases where someone ”forgets” something, but then remembers it shortly after. In the latter case, ”forgetting” means that the knowledge item has been stored in some portion of memory from which access is slow and uncertain.

These various shades of meaning also have relevance to AI systems performing lifelong learning. The advent of larger and cheaper memory stores makes it more and more viable for an AI system to persistently store a large percentage of its experiences. But nevertheless, memories with faster access remain considerably more expensive than those with slower access. So for example, in the case of an AI system that stores many of its thoughts and experiences to disk but keeps only a currently pertinent subset in RAM, the basic problem of ”forgetting” still remains, taking the form of deciding which information to keep in RAM and which to push to the ”back of the mind” on disk.

In general one may define ”forgetting” as the process of storing different memory items in different ways associated with dramatically different access speeds, the outright deletion of items from memory being one extreme. In any lifelong learning system containing a memory store with differentially rapid access, that is much too small to contain the system’s whole experience and knowledge base, forgetting will be a critical process.

Notwithstanding its general neglect in the AI field, forgetting is extremely important – basically, it’s about learning what is likely to be most important to be able to access in the future, with what level of access speed.

Forgetting in OpenCog

Next we very briefly describe the OCP (OCP) Artificial General Intelligence architecture, implemented within the open-source OpenCog AI framework, with a focus on those aspects most relevant to forgetting. Other cognitive architectures that have paid particular attention to forgetting include Pei Wang’s NARS (Wang 2006), Stan Franklin’s LIDA (Franklin 2006), and many more.

Conceptually founded on the ”patternist” systems theory of intelligence outlined in (Goertzel 2006), OCP combines multiple AI paradigms such as uncertain logic, computational linguistics, evolutionary program learning and connectionist attention allocation in a unified architecture. Cognitive processes embodying these different paradigms interoperate together on a common neural-symbolic knowledge store called the Atomspace. The interaction of these processes is designed to encourage the self-organizing emergence of high-level network structures in the Atomspace, including superposed hierarchical and heterarchical knowledge networks, and a self-model network.

OpenCog has been used for commercial applications in the area of natural language processing and data mining; e.g. see (Goertzel et al. 2006) where OpenCog’s PLN reasoning and RelEx language processing are combined to do automated biological hypothesis generation based on information gathered from PubMed abstracts. Most relevantly to the present proposal, has also been used to control virtual agents in virtual worlds (Goertzel and Et Al 2008), using an OpenCog variant called the OpenPetBrain, and humanoid robots (Goertzel 2010). These agents demonstrate a variety of interesting and relevant functionalities including learning new behaviors based on imitation and reinforcement; responding to natural language commands and questions, with appropriate actions and natural language replies; and spontaneous exploration of their world, remembering their experiences and using them to bias future learning and linguistic interaction. In the virtual agent and physical robotics contexts, OpenCog is intended for lifelong learning (although experiments so far have not run more than weeks).

Declarative knowledge representation in OpenCog is handled by a weighted labeled hypergraph called the Atomspace, which consists of multiple types of nodes and links, generally weighted with probabilistic truth values and also attention values (ShortTermImportance (STI) and LongTermImportance (LTI) values, regulating processor and memory use). Equations called Economic Attention Networks (ECAN) (Goertzel et al. 2010) are used to update the STI and LTI values dynamically, where the attention values associated with an Atom are dependent on how useful the Atom is estimated to be for achieving the system’s future goals (short or long term, respectively), and by STI and LTI spreading to it along links from other Atoms, roughly similarly to activation spreading in a neural network.

Forgetting in OpenCog

And when OpenCog’s declarative memory becomes full, how does forgetting happen? The Atoms with the lowest LTI are removed from RAM. Atoms also have a single bit variable called VLTI (V=very), which determines whether, upon removal from RAM, the Atom is saved to disk or just deleted. For instance, recently created speculative concepts that prove useless will generally just be deleted; whereas any Atom that has been significantly important in the past will likely be saved to disk.

When an Atom is saved to disk, what happens to the other Atoms in RAM that link to it? They retain a certain percentage of these links, which point to AtomHandles rather than Atom objects. To follow those links is an expensive operation undertaken only under special circumstances, as it involves loading the target Atoms into RAM. How many of these disk-directed links are retained is determined by the LTI of the linking Atom in RAM, and a system parameter.

Interdependency of Judgments about Forgetting

A subtlety is that there are many cases where one has a large set of memory items, so that none of the items individually is particularly important to remember, but so that it’s important for the memory to retain some of the memory items in rapidaccess memory rather than deleting them all or relegating them all to slow-access memory. This means that judgments about forgetting must be made on the level of memory networks.

To understand this better, consider again a system like OpenCog that retains many memory items in RAM, but relegates many more to disk. In this case, there is a major issue of how the system knows what it has stored on disk. One way of extracting information from the disk store is to use named Atoms such as those corresponding to natural language words. So if the system wants to know more about Australia, for instance, it can search its disk store and see if it previously stored there any Atoms linked to the ”Australia” WordNode. Once it has imported some of these Atoms into RAM, it can then import other Atoms linked to those, etc. However, this is a fairly crude method, which doesn’t help that much with recall of the system’s own novel conceptions (that may not be easily indexed using natural language words or other persistent external references). Suppose a system has learned a novel network of 1000 concepts and their interrelationships, which it thinks may be useful to it in the future, but which it feels it can’t afford to retain in RAM. What’s its best forgetting strategy?

One solution to this issue is for the system to retain a random (or more judiciously chosen) subsample of the 1000-concept network in RAM. Even if no individual Atom in the network is all that important in itself, retaining some of the network’s Atoms in RAM is a valuable thing, as these remaining RAM-resident Atoms can be used to bring the other members of the network back into RAM as appropriate. This phenomenon is accounted for in OpenCog via a special formula that boosts the LTI of an Atom if it is linked to relatively high-LTI Atoms on disk, which have few links to other Atoms in RAM. The quantitative weighting of this formula controls, in our example, how many of the Atoms in the 1000-concept network will remain in RAM (depending also on the LTI dynamics of the rest of the network).

Conclusion

We have reviewed some of the general issues involved with forgetting in lifelong learning systems, and summarized how these issues are solved in the OpenCog system. As OpenCog is architected quite differently than the human brain, one expects the brain’s approach to resolving the same problem to have many different characteristics, as well as likely some similarities. In general, different lifelong learning systems may handle the subtleties of forgetting differently, but all must address the same issues, such as the interdependency of judgments about forgetting; and these are issues that are largely particular to lifelong learning systems, not arising often in AI systems that are booted up only temporarily for the solution of individual problems.

 

REFERENCES


Franklin, S. 2006. The lida architecture: Adding new modes of learning to an intelligent, autonomous, software agent. Int. Conf. on Integrated Design and Process Technology.

Goertzel, B., and Et Al, C. P. 2008. An integrative methodology for teaching embodied non-linguistic agents, applied to virtual animals in second life. In Proceedings of the First Conference on Artificial General Intelligence. . IOS Press.

Goertzel, B.; Pinto, H.; Pennachin, C.; and Goertzel, I. F. 2006. Using dependency parsing and probabilistic inference to extract relationships between genes, proteins and malignancies implicit among multiple biomedical research abstracts. In Proceedings of Bio-NLP 2006.

Goertzel, B.; Pitt, J.; Ikle, M.; Pennachin, C.; and Liu, R. 2010. Glocal memory: a design principle for artificial brains and minds. Neurocomputing, Special Issue of Artificial Brain.

Goertzel, B. 2006. The Hidden Pattern. Brown Walker.

Goertzel, B. e. a. 2010. Opencogbot: An integrative architecture for embodied agi. Proceedings of ICAI-10, Beijing. Wang, P. 2006. Rigid Flexibility: The Logic of Intelligence. Springer.

 


Ben Goertzel Ph.D. is a fellow of the IEET, and founder and CEO of two computer science firms Novamente and Biomind, and of the non-profit Artificial General Intelligence Research Institute (agiri.org).
Print Email permalink (2) Comments (825) Hits •  subscribe Share on facebook Stumble This submit to reddit submit to digg submit to Twitter


COMMENTS


The issue I see with forgetting things like "this idea didn't work" is that the system is possibly going to go through another iteration of the same or similar idea. That suggests that most things are going to be stored in slow access rather than deleted since deletion will force the re-learning of that particular lesson. The exception might be extremely rare occurrences, but they are infrequent enough that permanent storage wouldn't be an issue.



Interesting, it's like a natural selection of concepts, the ones more used survive, the ones less used perish. However, I wonder if forgetting is really essential. Our memory capacity is incredibly big and "idiot savants" such as Kim Peek could remember nearly everything such as a whole library, songs and life events, without problems locating a specific memory among all of them. So the hypothesis that too much memory gets confusing seems to me to be wrong.

http://en.wikipedia.org/wiki/Kim_Peek

It seems that it is rather the form of encoding the memories that causes their idiocy (processing things too literally rather than abstractly), rather than being confused by the amount of memories already encoded. I never saw an example of someone who had significant cognitive problems because of not being able to forget, albeit the suffering problems with PTSD and traumatic memories, but it's not clear that there would be increased susceptibility for it, and traumatic memories could be dealt with using certain memory erasing drugs (there are a few of them) and therapies.

While prioritizing certain connections in memory may be useful for behavior, I believe that forgetting is not, or at least that we could forget much less than we do and still function fine. For AI, forgetting should be even less important, as long as there is sufficient memory space outside of the prioritized-access RAM to store memories of less usefulness. When all the space has been utilized, then forgetting may come into play, otherwise prioritizing seems enough.

The justification of our flaws is sometimes used to reject transhumanism, and I find these attempts quite unfounded and wrong, though I'm sure that this post is completely different.



YOUR COMMENT

Name:

Email:

Location:

Remember my personal information

Notify me of follow-up comments?

Please enter the word you see in the image below:




Next entry: Hybriduality and Geoethics (part 1)

Previous entry: New Science: Six Tips for Avoiding Sickness this Winter

HOME | ABOUT | FELLOWS | STAFF | EVENTS | SUPPORT  | CONTACT US
SECURING THE FUTURE | LONGER HEALTHIER LIFE | RIGHTS OF THE PERSON | ENVISIONING THE FUTURE
CYBORG BUDDHA PROJECT | JOURNAL OF EVOLUTION AND TECHNOLOGY

RSSIEET Blog | email list | newsletter | Podcast
The IEET is a 501(c)3 non-profit, tax-exempt organization registered in the State of Connecticut in the United States.

Contact: Executive Director, Dr. James J. Hughes,
Williams 119, Trinity College, 300 Summit St., Hartford CT 06106 USA 
Email: director @ ieet.org     phone: 860-297-2376