SERF, Thoughts

Programming from Data

A programming data-driven shouldn’t bother with format first

The user provides a bunch of bits at first, or let some in the memory be of the given input value

Then formats are added to it:
– Simple ones at first, that allow to build simple interpretations such as html,jpg,wav,…
– Growing more complex ones on top of them from more complex logics: tone, color, frequency, face,…

As the formats are stacking, the interpretation gets more details and more complex: the input tends to be fully resolved with infinite heterogeneous formats and infinite time

As formats are numerous, to describe a variety of things and their abstractions, it is helpful to count on a validity domain for each format. That way, automated interpretation can reject some of the formats. Applying a learning orchestration algorithm, structures between those formats can be inferred to identify subsets, similarities or, mainly, accelerate format testing; e.g. orchestrate formats in a bayesian tree according.

Each detected format is linking to a language, with its own set of rules, which will lead to an interpretation. Meaning a projection of the input node to a space constraints by those rules.
Besides correctness of the projection, there can be multiple valid projections in the same set of rules.
For instance, I can say: “The blue car of daddy is really quick” expressing the same fact as “Dad’s new car goes really fast”. It provides different subset of information, different words within the same language rules and conveying the same meaning (as per definition) linking to the same node, for the same format, but interpreting it in two different sentences (suite of expressions).

From these sentences, I get new nodes. By adding format rules regarding the natural language processing, I can get parsed expression from these; a graph of new nodes. And either play with words matching to infer interpretation structure, or develop on the sentence structure from the sentence root or… well, it’s lego time.

Also, even cooler, it gets pointers done right (as data should be contained in its node) and should focus on defining and structuring formats.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s