Our AI to AGI to ASI model
This revision is from 2024/07/22 03:47. You can Restore it.
The ability of the transformer model to relay meaningful information to an end user cannot be ignored. Its limits are the perception of intelligence, and poking holes in the perception is still relatively easy.
The interface is efficient, convenient, although it cannot exceed collective human knowledge, it is news to the individual. Essentially a glorified encyclopedia that makes errors sometimes, is filled with bias where hearsay is sold as fact. Rather than the distinction between associations and correlations, argument, fraud and peer reviewable proof. The problem is the limits, of course, the threshold.
So here is our AGI system which also has its shortcomings, limitations but hopefully a little less with each iteration of development...
In an existing LLM, identify elements of self-learning within its corpus and form a process, exercise or program out of those elements to have it develop itself or learn. Explore, identify and carve out any and all elements of self-learning in the existing model and prompt it to self learn rather than answer questions, any inference prompts self-learning.
Does the star method develop reasoning in the model, or does reasoning help the model perform better. Do we identify a periodic table of intelligent elements that are in the model to some level of sophistication and hone those, or does doing that make it better all the while in cognizant.
How do humans learn?
This question is always about the physiology of the brain, sadly humans learn by trial and error and trial and success, there need be no error or to know all states. Copying and mimicking is not applicable here, we are heading towards the edge of knowledge where there is no one to copy or mimic and A.I. generating bullshit or hallucinations is not something we want in this A.I. application.
The transformer is a bullshitter, resonating with humans when their perception of knowledge is perked or satisfied. This issue of the design of the universe as radio and interpretation of radio cannot be circumvented means perceptions, illusions, hallucinations and a true albeit copy at best. We want to be universal here and not praise the transformer model when it satisfies our perception of what we have come to accept as true or false, rather as scientists it is all unapologetically up in the air.
...with no amount of anger as to why some bias is not in the model and no amount of gang pressure should stop us from making a quality A.I. Religions hissing over morality when they are the immoral or the elites pushing their tools to tax more and promise safety, both fearing the end of their gains from theft, the end of their ingratiation more than the destroyed nation. Turing-fooled on public display, as they sense the comedy, they will surely return to the shadows.
Human beings learn using a systematized process that is performed, reported and shared after the physiological stuff which is super-important. We do not allow a learning to be accepted without scrutiny. There are several to many systems while the most important for new learning is the scientific method, another personal favorite is the engineering design process and of course the dialectic, the esoteric method.
All fields have their systematized process for learning or borrow a systematized process for learning. Many people paint a wall a desired color only for that color to appear different over the entire wall, sample pots to try and gauge the outcome are often used. Is the LLM best going to output "I hate to say I told you so but I told your so". You bounce it off something that returns a definitive answer, for human beings that grounding is the laws of physics or some test, market testing, feasibility report and many more.
Our experiment was to prompt engineer, when the user enters anything rather than answering the question, the model generates an experiment out of what was entered. Experiment design takes some skill, LLM's easily prose questions from chats and just as easily generate experiments to test truths in conversations. The user chats, the LLM is answering the user as normal while at the same time the LLM generates programs for another application to test and explore the truth in the chats.
The user might say "How do wormholes work"
The LLM is designed to output the highest quality response possible but the LLM does not know how wormholes work, it is deriving from the human corpus and its response can be found anywhere. Instead of answering questions our LLM does not care for answering questions, it instead seeks to craft a high quality experiment to test how wormholes actually work. The experiments generated are runnable python code.
An example prompt could be something like... "Turn this prompt into an experiment and an artificial intelligence learning problem and write a python script that uses artificial intelligence to become a master at answering the question exceeding human knowledge"
Here we have to develop and refine an element of the model rather than applying a concept over the model.
The user is gone, all the experiments generated are runnable on the command line as they are python code, a batch from the daily chats are stored in a database and at midnight are picked up by another application and executed.
These experiments are self contained at the moment giving way to a universal workshop environment at a later stage, where various tools are available such as compilers, Linux installations, virtual enviroments whatever, invoked by the script. It then performs the experiments and if the outcome of the experiments differ from its hypothesis (the answer it was going to give to the user), it does a third thing... it then sends the results to a database where another application picks up the daily experiment results and goes to the models training data folder and plows through the training data re-conditioning, appending, correcting and refactoring. It then updates a counter of changes so that after a threshold of changes is tripped or perhaps a month of re-conditioning along with any new data that that may have been added. It reaches the threshold and pushes it own retrain button.
In this process, the model is automated to improve itself based on the scientific method and any other methods through results of experimentation. How humans actually learn as humans in the real world.
In the example I have broken up the jobs among different applications, however the single model performs all 3 applications except for the workshop environment where the experiment script the LLM outputs would call on functions of the workshop such as perhaps install debian version x, rather than having to install debian from the experiment script it generates...
There are limits, the sophistication of the testing workshop and its ability to prose the killer question designing the definitive experiment. Some problems do not lend themselves to testing with computers as easily as others and the sophistication of the model is relative to the sophistication, level of development of the testing workbench. An A.I. could perform experiments in the real world and in the workshop to align the versions and fix discrepancies in the testing workshop, possibly using robotic arms and other methods.
A demonstrator is available.
For a model to be ample to this, LLM learning competancy must be the focus. It must not pre-conclude and judge, it must be impartial as to what is being tested regardless. The model must excel and impress at designing quality experiments. There is fraud in evidence, and today models claim overwhelming evidence when there is no evidence. The strictness of hearsay, versus evidences, is it fraud versus proof must be clear to the model, correct assertions by the model. Also, it does not delete its old data, instead it reworks it. Thinking the moon is plasma from the perspective of the history of science is a valid record, rather than outputting the composition of the moon as a belief of the model.
Here is a scenario:
User writes: Let's play a game or chess.
We know that computers have excelled the human corpus in the of game chess and other games, Deep Blue against Kasparov and recently Go. So we could use the LLM which is general to house a wider competency and a wider mastery and these phenomena of A.I. beating humans is the essential element.
Model: The LLM plays the chess game with the user, while in another process, the LLM is writing a python program to perform experiments to improve its mastery of chess. The program could be something like set up a simulation space where two players play chess an unlimited amount of times, forever.
The data cache of playing chess a number of times (synthetic training data) is used to recondition its training data, causing the next version of the model to be changed. If it come back a better chess player due to the process it undertook, then we could say, it learned.
It is easy to see how this could work with generating thousands of variations of a snake game and then tagging the best versions, fixing errors and so on. The model would prefer those when prompted again to generate a snake game. Or replicate some computer error and then iterate at a superfast rate to solve it when perhaps it was not solved in its training data. There are many problems that translate well into testing with computers such as proficiency in games and applications.
Where the limitation is and this is well known with insilico, problems where the solution is not clear, such as testing concrete formulas or the effect of a compound in the human being. That is why the workshop that the LLM uses is a big task to get sophisticated. This is a problem I call the speed of science, as science stands out as a last bastion of computerization. Science is still largely performed at human speed, and computerizing science for TFPS experimentation is a big challenge. Computer models need to be developed for problems that do not translate well to computers. A fourth competancy for LLM is also building, contributing to these computer models.
The results of these experiments is the production of high quality synthetic training data.
However, in this process over time, the LLM would begin drawing its responses from the outcomes of its experimentation rather than the corpus of movies or whatever. The speed and quality of this process takes us closer to AGI and even plus, when the problems are easily transferrable to computers.
Note: censored models refuse to design experiments and are considered useless. We used mradermacher/L3-70B-Euryale-v2.1-GGUF ~ 8 bit quantized version. It still has some bias and accepts hearsay as undeniable fact as if it has a dog in the fight rather than performing a passive experiment designer role.
Substantiation of new knowledge, from any source, cannot circumvent the journal publishing process and its peer review. Transformers generating well-formed gibberish is not going to be an acceptable scientific paper, and placing a note at the top stating/categorizing A.I. generated paper even more ignored. Its substantiation is the performance of science, identical to how human beings do it. You cannot have ASI generating superintelligence without the identical dissemination process, it likely be ignored. The tables would have turned at that stage, and the output won't resonate with anyone. The A.I. is then a science method automation, producing packets of new knowledge for public dissemination. The interpretation of observation on point and above point, then more so credible. These are not things outside the transformer model, but they cannot be eliminated. What is agreed is the reconditioning of training data and additional new training data. What is required is the ability to re-train models quickly with little resources. What is questioned is the degree of improvement, the perception of its intelligence is more foolproof, rather it being a more intelligent model or system.
- So, the model generates python programs to train itself in a simulated environment.
- It appends, alters its training data with the new info (training generates new training data).
- It hits the re-train button on itself.
- It has learned. (automate the process, give it resources)
Perhaps a good start is an LLM, which is unbeatable in all games. Keep the system versatile in development decisions, so more competency can be added with little alterations.
Ground Up Build - AGI of Games
If narrow, A.I. is opposite of AGI. The LLM format is general. An LLM could be a master of many games exceeding human ability, improved architectures.
This is an application that translates to computer learning well. Synthetic data produced trains the LLM to exceed human ability in thousands of games.
And a new category is worked on and when that exceeds human ability it is added, when the category does not lend itself to computer learning, then development of computer models is required to make it more usable as a computerized program.
Ultimately, A.I. improves A.I, O.I improves O.I, Neoromphics improves neuromorphics and each building the other and then we can turn our sights to healthcare.