# Rational OpenCog Controlled Agent
## Description Rational OpenCog Controlled Agent, or ROCCA, is a project aiming at creating an opencog agent that acts rationally in OpenAI Gym environments (including Minecraft via MineRL and Malmo). At its core it relies on PLN (Probabilistic Logic Networks) for both learning and planning. In practice most of the learning is however handled by the pattern miner, which can be seen as a specialized form of PLN reasoning. Planning, the discovery of cognitive schematics, is handled by PLN and its temporal reasoning rule base. Decision is currently a hardwired module, heavily inspired by OpenPsi with a more rational sampling procedure (Thompson Sampling for better exploitation vs exploration tradeoff). ## Status For now learning is able to 1. Discover temporal patterns based on directly observable events via the pattern miner. 2. Turn these temporal patterns into plans (cognitive schematics). 3. Combine these plans to form new plans, possibly composed of new action sequences, via temporal deduction. The next steps are 1. Add more sophisticated temporal (including dealing with longs lags between cause and effect) and then spatial inference rules. 2. Integrate ECAN, for Attention Allocation, to dynamically restrict the atomspace to subsets of items to process/pay-attention-to. 3. Record attention spreading to learn/improve Hebbian links. 4. Carry concept creation and schematization (crystallized attention allocation). 5. Record internal processes, not just attention spreading, as percepta to enable deeper forms of instrospective reasoning. 6. Plan internal actions, not just external, to enable self-growth. ## Requirements OpenCog tools - cogutil - atomspace (revision b15826224, others have a bug in the pattern matcher) - ure - spacetime - pln - miner - [optional] cogserver - [optional] attention - [optional] opencog Third party tools - Python 3.10 (or Python 3.8 see below) - jupyter notebook - python-orderedmultidict https://pypi.org/project/orderedmultidict/ - fastcore https://fastcore.fast.ai - OpenAI Gym https://gym.openai.com/ - MineRL https://minerl.io - nbdev https://nbdev.fast.ai - black https://pypi.org/project/black/ ### Python 3.10 vs 3.8 Python 3.10 offers a better out-of-the-box type annotation system than Python 3.8 and is thus the default required version. However you may still use Python 3.8 by checking out the [python-3.8-compatible](https://github.com/opencog/rocca/tree/python-3.8-compatible) branch. Beware that such Python 3.8 branch may not be as well maintained as the master. ## Install In the root folder enter the following command: ```bash pip install -e . ``` For the tools used for development: ```bash pip install -r requirements-dev.txt ``` ## How to use An OpencogAgent defined under the `rocca/agents` folder is provided that can used to implement agents for given environments. See the examples under the `examples` folder. There are Jupyter notebooks provided for experimentation as well. To run them call jupyter notebook on a ipynb file, such as ```bash jupyter notebook 01_cartpole.ipynb ``` ### TensorBoard support Some experiments, notably the notebooks, use TensorBoard via the `tensorboardX` library to store event files that show certain metrics over time for training / testing (for now it's just rewards). By default, event files will be created under the `runs/