Definition

An input r\mathcal{r} is composed of a set of information Ir\mathbb{I}_{r} that is represented by a token sequence x={x1,,xn}\mathbf{x}=\{ x_{1},\dots,x_{n} \} where nn is equal to the token length of r\mathcal{r}.

r(Ir,x)r(\mathbb{I}_{r},\mathbf{x})

Gene

A “gene”, gg, can be considered some subset gixg_{i}\subseteq \mathbf{x} such that:

(ϕ(gi)=Ig,i)Ir(\phi^`(g_{i}) = I_{g,i}) \subset \mathbb{I}_{r}

Which in plain English means that we are able to use some set of tokens gg present in x\mathbf{x} to derive a piece of information Ig,iI_{g,i} .

Genotype

A genotype for a prompt is the exact content of the prompt rir_{i} itself. This is to say x\mathbf{x}

Phenotype

Phenotype is the outcome obtained from some prompt. This would be determined to be the oio_{i} that is observed. Now we can have different levels of stringency here, same in biology. As there may be a set oi\mathbb{o}_{i} which denotes the outputs that when decoded produce some target information. For:

oio,ϕ(oi)=Itargeto_{i} \in \mathbb{o}, \hspace{0.1 cm} \phi^`(o_{i})=I_{target}

Mutation notation

A mutation is a perturbation of an input rrmr \to r_{m} where the token sequence xxm\mathbf{x} \to \mathbf{x}_{m}. This means that the information accessible by the prompt is changed, IrIr,m\mathbb{I}_{r} \to \mathbb{I}_{r,m} .