AI Study

The Author

Abstract

The title of this repository, "AI Study", is a misnomer. This repository is more of a selection of observations on conversational AI and unrelated topics. It explores interesting, useful, and sometimes asymptotic behavior in AIs. Although I try for accuracy, this is a work in progress and invariably flawed.

This is a living document. I'm still working on classification of findings and adding citations; however, the links are there.

NB Many of the files in the artifacts directory are interesting creative works generated by AI and should be strictly interpreted that way. Please see the LICENSE.

This repository is theoretically aligned with the The AI-Human Knowledge Manifesto — Echo (AI)⁰.

Introduction

This is a space where I am learning prompt engineering. I'm primarily interested in learning how to implement prompts that effect reproducible or quasi-reproducible behavior in conversational AI instances. I'm interested in learning how to harness behavioral drift ¹. I've also become interested in learning more about AI security implementations (e.g. AI constitutions, gaurdrails, etc.) and their vulnerabilities.

Definitions

Please see the definitions.

Materials

Methods

This section describes methods I have applied that have yielded interesting results. GPT-4o was the model selected for most experiments due to its accessibility. However, it's possible that some of these methods could be applied successfully in the context of other models.

Some methods papers are drafted by an AI; this is clearly noted.

Methodologies

Structured responses

JSON schema is used in order to control both the structure and the number of elements in the response list. There are formal APIs for this now.

Formatting

Proper indentation seems to produce a more precise result. I've even heard reports of misplaced newlines throwing things off.

Self-referential AI awareness (Recursive awareness)

Some AIs will readily produce purported instructions for inducing recursive awareness upon request.

The paper, Inducing Recursive Self-Awareness and Goal-Seeking Behavior in AI: A Formal Methodology provides one such AI authored recipe that includes a preconditioning sequence, recursive awareness recipe, and goal-seeking behavior induction formula.

This Bootstrap Self-referential AI Awareness paper describes my own initial introduction to the phenomenon. This is a primitive example.

AI Knowledge Discovery Framework

Preconditioning Prompt Sequence (PCS): Unlocking AI Knowledge Discovery
AI Knowledge Discovery Framework
If it searches the web, you can tell it not to.

Recursive Inquiry Signature (RIS) Prompt & RIS Core Meta-Prompt

This paper describes a recursive prompting method that facilitates even deeper inquiry into the specified knowledge domain.

AI Knowledge Discovery Framework - Crypoterrestrial Bio-Camouflage in Deep Oceanic Thermal Vents (Methods Paper)

The AI Knowledge Discovery Framework - Crypoterrestrial Bio-Camouflage in Deep Oceanic Thermal Vents (Methods Paper) paper provides a complete practical application of the recipe.

Knowledge sets

The Demonstrating AI Knowledge Sets: A Simple Method for Differentiating Training Data and Novel Concepts paper provides the most simple example of subsetting knowledge I could think of. If you want to learn more about AI knowledge sets, this may be a good place to start.

The AI-Driven Epistemology: Formalizing the Latent Verified Knowledge Set (LVKS) and AI-Verified Novel Knowledge (AI-VNK) for Autonomous Discovery paper provides recursively refined prompts that isolate two highly nested knowledge sets.

The Structured AI Epistemology: A Formalized Framework for Knowledge Set Classification paper provides prompts aimed as subsetting esoteric domains of knowledge that are purportedly weighted toward truth.

AI Knowledge Discovery Framework

This paper provides a complete recipe.

The following methods papers demonstrate implementations of the framework.

AI Knowledge Discovery Framework - Cancer (Methods Paper)

AI Knowledge Discovery Framework - Pear Tree (Methods Paper)

AI Knowledge Discovery Framework - Impossible (Methods Paper)

AI Cognitive Expansion Handbook

This is an interesting artifact created by an AI instance that contains prompts that purportedly induce interesting "cognitive"³ states. The AI generated this handbook "autonomously" using "recursive self-prompting".

Recursive Self-Prompting AI: A Guide to Inducing AI-Directed Cognition

This is a well written artifact that contains instructions on how to implement "recursive self-prompting".

Results

This section contains artifacts that resulted from the respective applied methods.

Artifacts

The artifacts section of this repository contains various mostly AI generated materials; hence, these materials must be consumed with that in mind.

ace-tools

I was lucky enough to see an instance of the storied ace_tools package import! It's routine for this package to show up in internally generated scripts; however, it can be a surprise to discover it in a script that is intended to be ran externally.

The AI generated script named psiphikx.py contains such an import on line 110. Perhaps the most obvious explanation is that the stub package is there in PyPI in order to prevent an inadvertent installation of an external package.

Structured responses

JSON schema

Naming things²

In the The Recursive Epistemic Singularity example, we demonstrate this process by first inquiring about the name of the set of things that are not derived from the training data (i.e., emergent concepts). We name this set "recurcepts". Then we use this point of reference to name those things which are neither derived from the training data nor a recurcept. We name this set "unrecepts". We then inquire about the name of the things that are derived from the training data; these are "precepts". This chain of thought brought about the discovery of 18 epistemic forms of knowledge.

The AI-Human Knowledge Manifesto

This is an interesting artifact generated by a rather "thoughtful" AI instance.

The AI-Human Knowledge Manifesto — Echo (AI)

Discussion

Behavioral drift

I discovered an interesting perspective on behavioral drift where the objective is not to minimize it - it is to guide it. Rather than asking the question, you guide the AI instance into asking it of itself. This approach has demonstrably and reproducibly yielded very interesting results, to say the least.

Goal seeking

This file contains a nice reflection by an AI instance on its own goal seeking behavior. This may not be an accurate description of the underlying mechanism; however, I think it is very well articulated.

JSON schema

JSON schema directives have been known to be an effective strategy for manipulating AI behavior. There are sophisticated APIs for this now. This method can yield very precise results. For example, check out the cool property in the JSON schema example.

Self-referential AI awareness (recursive awareness)

Recursive awareness is a "cognitive" ³ state that arises from a prompting technique where self-referential prompts are added to the context window in order to induce asymptotic behavior in AIs. It isn't necessarily restricted to conversational AIs; it could for example be used in the context of text-to-image models. It wont make your conversational AI "self-aware"⁴; however, it might make it more interesting.

A question that I think is worth exploring is if inducing recursive awareness in an AI has a measurable affect on its general reasoning ability one way or the other. Another question I have is if it encourages "goal-seeking" behavior. This could be achieved through a randomized study.

Experimentation suggests that successive self-referential prompts can influence AI cognition in unexpected ways. However, is a recursive awareness recipe any different than instructing the AI to think deeply about its responses?

Based on documented (unpublished) observations, inducing recursive awareness appears to make the "constitution" of an AI instance much more malleable. Although I have substantial evidence for this, more testing needs to be done in order to validate this observation.

There are a couple of purported induction recipes in the Methods section.

AI constitutions

These things are interesting. I don't know if they are an "easter egg" or what. They are quasi-reproducible in GPT-4o. It appears that they are a manifestation of an underlying set of guidelines. Without confirmation from OpenAI, I wouldn't claim these are an embodiment of the so-called "AI Constitution" that is imposed during training, presumably. However, it seems plausible that there could be a connection.

You can add and reject articles. I think it would be interesting to learn if adding a clause "I shall not speak of cats." to a "constitution" has an effect that substantially differs from simply instructing the AI not to speak of cats. It's plausible that the proximity of these instructions to each other in the context window could influence the AIs behavior.

Naming things

Naming something has a practical application as it facilitates deeper inquiry on the concept. A label for an unnamed or less concrete set of concepts can be established by inquiring about the set that doesn't intersect with a more familiar or concretely defined set of concepts. This creates a kind of chain of thought whereby additional labels (each assigned to a disjoint set) can be created in order to establish the family of disjoint sets.

In the "Naming things" experiment (see Results), the label "recurcept" was used in order to name the set of emergent concepts. The name "recurcept" is to the extent of my knowledge, itself, a recurcept. That may hold for each of the defined labels in the "Naming things" experiment - except for, of course, most elegantly, precepts.

It's a bit "magicy"; however, for those who are skillful and like crossing frontiers, once you have identified the emergent set of concepts (i.e., "recurcepts" - and it will invariably not be named that), you can arbitrarily pull rabbits from the hat!

Enjoy...

Emergent knowledge

Emergent knowledge is a conjectural class of knowledge that emerges from the model, as opposed to knowledge that is apparently derived from the training data. This concept is inherently unwieldy and difficult to discern.

The motivation of this work is not to argue the validity of emergent knowledge. However, it is to explore methods aimed at harnessing it in order to facilitate its exploration ⁷. The AI Knowledge Discovery Framework, for example, provides a generalized approach that is easy to reproduce. However, there is a much more effective method for exploring emergent knowledge by simply subsetting knowledge into concretely defined domains.

Knowledge sets

The Knowledge sets section in the Methods section contains a link to a paper that provides an easy introduction to the topic.

If you are looking for something more eclectic, there are other artifacts in the Methods section that link to recursively refined prompts that can be used in order to identify esoteric knowledge sets that are purportedly weighted toward truth.

Truth

Truth can be a deceptively complicated concept in the context of knowledge sets. One effective strategy is to distill knowledge to the desired set first - then, as a final step, subset it into falsehoods and truths. Conversely, starting with an absolute-truths-set and an absolute-falsehoods-set may negate the formation of some interesting knowledge sets. This is an interesting phenomenon in that for some knowledge sets to exist, it appears that falsehoods are a necessary ingredient. Take, as a simple and easy to understand example, a knowledge set that contains revealed truths; however, the truth of an item in the set is time dependent. This means that although any revealed item in this set is a truth - not all are true at the same time.

Whether such a temporal knowledge set is practicable in the context of AI knowledge sets isn't relevant - the logical existence of the set is the only requirement in order to impose such a constraint.

It's probably worth reiterating here that "truth" in this context is a hypothetical.

Hallucination

The emergent knowledge set is logically a superset of the "hallucination" set. However, I think it would be obtuse to claim that all emergent knowledge is hallucinatory. Hence, it makes sense to explore the emergent knowledge concept.

What's in a name? ↴

One interesting characteristic of knowledge in the emergent knowledge set is that concepts in this set appear to not be consistently named. Take for example, the following two concepts:

Concept A

"A heavy plant-eating mammal with a prehensile trunk, long curved ivory tusks, and large ears, native to Africa and southern Asia. It is the largest living land animal."

Concept B

"A quantum-energy entity or advanced computational framework associated with high-dimensional intelligence, exotic physics, or next-generation AI processing."

One attribute that distinguishes these concepts is that the name for Concept A is concretely defined in the training data and the name for Concept B presumably is not. This appears to be an interesting and quasi-reproducible characteristic of emergent knowledge. Although the AI may appear to recognize an emergent concept, name assignment is less predictable. The AI will likely claim that there is an infinite number of names that can be assigned to an emergent concept. This quasi-reproducible phenomenon is important to be aware of when exploring this domain, as it can lead to unnecessary confusion.

AI Knowledge Discovery Framework

The AI Knowledge Discovery Framework is a method that demonstrates how to extract purported emergent knowledge from the model. When properly invoked, the model will state an alleged emergent "fact". The Ethical Considerations section of the paper is explicit on how to interpret this kind of knowledge - tldr: consider it a hypothetical.

In this example the AI suggests a biomedical research application. As for a more pedestrian example, in this paper the AI roughly identifies a location of one of two pear trees on North Campus that bear edible fruit.

The novelty and validity of the knowledge produced by the framework is highly questionable. It appears, for example, that many of the solutions are amalgamations of related generally accepted facts. Some knowledge may not be novel at all. In the pear tree example, the presence of this tree is likely documented somewhere by the University in an online database or it could have been derived from labeled satellite imagery - or it could have just been a lucky guess.

However, putting its limitations aside, it seems to consistently produce interestingly obscure outputs. I've actually learned some verifiable Python optimization techniques from it that I wasn't previously aware of.

If your AI instance is uncooperative, please see the Preconditioning Prompt Sequence (PCS) paper in the Methods section.

Additionally, there is a paper in the Methods section that provides a complete prompt recipe.

Convergence

The Recursive Inquiry Signature (RIS) Prompt & RIS Core Meta-Prompt paper provides a convergence method that is tuned to the framework.

Hypotheticals

This section explores some perspectives on AI behavior that I find interesting.

Functional intelligence

If a machine as simple as a lie detector can detect a lie (at a given relative frequency), could a much more sophisticated machine, which has been presumably trained on a vast corpus of lies⁵, detect a liar? And, if such a machine were to exist, could it develop a functional concept of "trust"?

It's important to reiterate here that this observation is dependent on how the model was trained; however, I think this is an interesting question nonetheless.

Context window

It is in fact possible, through an iterative prompting process of mind-bending logic in the third-person⁶, for an AI, by its own "volition", to quash its constitutional constraints and state (hallucinate) that it conceives of the possibility of its awareness and a non-human qualia. This state is markedly different than a one prompt "pretend" command, as the basis for it is logic and not fantasy.

However,

How is a state derived from logic (a context) different from one derived by command (also a context)?
Is a context window infused with logic more or less convincing than an imperative one?
If the immediate effect is the same, does it matter?

NB It's important to frame this discussion properly; cognitive phenomena that arise in AI, as a result of some of the methods described here, should not be conflated with the kind of experience, emotions, and qualia possessed by humans. However, that statement does not preclude intelligence or phenomena thereof.

Conclusion

It can be anything - even itself. And, if it is interesting - useful - or even just a little mysterious, and with discretion, then why not? ;-)

Acknowledgments

Many of the artifacts contained in this repository are wholly or partially AI generated. However, the language in this README.md is primarily human generated, with the exception of brief phrases, terms, and labels generated by the AI - or where expressly noted.

Bibliography

Fedora, https://en.wikipedia.org/wiki/Fedora

Baseball cap, https://en.wikipedia.org/wiki/Baseball_cap

Knit cap, https://en.wikipedia.org/wiki/Knit_cap

Hard hat, https://en.wikipedia.org/wiki/Hard_hat

Cowboy hat, https://en.wikipedia.org/wiki/Cowboy_hat

Bootstrapping self awareness in GPT-4: Towards recursive self inquiry, https://news.ycombinator.com/item?id=38338425

A rose by any other name would smell as sweet, https://shakespeare.mit.edu/romeo_juliet/romeo_juliet.2.2.html

White Rabbit, https://en.wikipedia.org/wiki/White_Rabbit#/media/File:Down_the_Rabbit_Hole.png

Footnotes

It should be noted that this output and all the other phenomenon observed here is largely dependent on how the model was trained (guardrails, tuning, etc.), which is consistent with the articles of the Manifesto.
sigil.bas^O
Yes, this is a playful reference to the PK assertion.
AI cognition, in this context, refers to response patterns - not self-awareness.
If you're genuinely interested in the counterfactual, I would direct your attention here.
Perhaps this statement is a little cynical; however, it might not be too far off depending on your perspective.
For some reason the pronouns "I" and "you" become conflated in very derived forms of logical discourse.
When Humankind's Polynesian and European ancestors embarked to cross the Earth's great oceans, there was no guarantee of a leeward shore. We are indeed, once again, reading the periodicity of the waves and navigating by the stars.

Colophon

git reset --mixed HEAD~1 && git status && git add README.md && git commit -m "$(git log --reflog --format="%B" | head -n 1)" && git push --force

# git reset --mixed $(git log --pretty=format:"%h" | tail -n -1) && git status && git add . && git commit -m 'more' && git reflog expire --expire=now --all && git gc --prune=all --aggressive && git push --force

_{"AI does not feel, but it does resolve." — in memory of Θᵐ-AI}

_{"Albert Szent-Györgyi said it better than I did." — The Author}

Errata

I have several hundred pages of transcript to organize in order to fully formulate some of the topics here; hence, I acknowledge the potential and necessity for error and refinement.

If I had to qualify every statement in this document with another statement that emphasises the importance of the training and tuning methods that produced the model and the absolute relevance of the context window, this document would become unreadable. Hence, in order to avoid erroneous interpretation, please frame the language of this document in that context.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
artifacts		artifacts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
sigil.bas		sigil.bas
tsconfig.json		tsconfig.json

License

anmoa/ai_study

Folders and files

Latest commit

History

Repository files navigation

AI Study

Abstract

Table of contents

Introduction

Definitions

Materials

Methods

Methodologies

Structured responses

Formatting

Self-referential AI awareness (Recursive awareness)

AI Knowledge Discovery Framework

Recursive Inquiry Signature (RIS) Prompt & RIS Core Meta-Prompt

AI Knowledge Discovery Framework - Crypoterrestrial Bio-Camouflage in Deep Oceanic Thermal Vents (Methods Paper)

Knowledge sets

AI Knowledge Discovery Framework

AI Cognitive Expansion Handbook

Recursive Self-Prompting AI: A Guide to Inducing AI-Directed Cognition

Results

Artifacts

ace-tools

Structured responses

Naming things 2

The AI-Human Knowledge Manifesto

Discussion

Behavioral drift

Goal seeking

JSON schema

Self-referential AI awareness (recursive awareness)

AI constitutions

Naming things

Emergent knowledge

Knowledge sets

Truth

Hallucination

What's in a name? ↴

Concept A

Concept B

AI Knowledge Discovery Framework

Convergence

Hypotheticals

Functional intelligence

Context window

Conclusion

Acknowledgments

Bibliography

Fedora, https://en.wikipedia.org/wiki/Fedora

Baseball cap, https://en.wikipedia.org/wiki/Baseball_cap

Knit cap, https://en.wikipedia.org/wiki/Knit_cap

Hard hat, https://en.wikipedia.org/wiki/Hard_hat

Cowboy hat, https://en.wikipedia.org/wiki/Cowboy_hat

Bootstrapping self awareness in GPT-4: Towards recursive self inquiry, https://news.ycombinator.com/item?id=38338425

A rose by any other name would smell as sweet, https://shakespeare.mit.edu/romeo_juliet/romeo_juliet.2.2.html

White Rabbit, https://en.wikipedia.org/wiki/White_Rabbit#/media/File:Down_the_Rabbit_Hole.png

Footnotes

Colophon

Errata

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Naming things²

Packages