The Current Position of Sora2: Beyond Video
Generation, Toward a “World Model”
Sora2 is not merely a video generation AI.
Its internal architecture resembles a probabilistic world model — a system
capable of simulating reality itself.
By taking text as input, Sora2 does not simply render static
frames;
it reconstructs the continuity of space and time.
When waves crash, smoke drifts, or shadows shift,
these are not pre-programmed effects —
they are statistical reconstructions of physical coherence,
learned from countless fragments of visual reality.
In essence, Sora2 is no longer a model that “creates videos,”
but one that reconfigures the laws of reality.
Here, the principle of creation itself begins to invert.
The Meaning of Its Union with AR: Overwriting
Reality
The convergence of Sora2-like models with Augmented Reality (AR)
is not a mere technical progression —
it is an ontological transformation.
Current AR remains at the stage of layering virtual objects
over the physical world.
But when video generation AIs begin to output data
that includes depth, geometry, reflection, and physical properties,
the AI will be able to replace reality itself.
At that moment, AR will evolve from “Augmented Reality”
to “Mutable Reality.”
Humans will no longer watch screens;
they will breathe within worlds generated by AI.
This is neither art nor entertainment.
It is a technology for redesigning the strata of existence.
Through this realization,
human perception, being, and memory
will all be reconstructed within a new context.
The Liberation of the Body: Transference of
Consciousness and the Fluidization of the Senses
At the end of this convergence lies what may be called
the liberation of the body.
Human beings have long constructed their reality
through the five senses — sight, sound, touch, taste, and smell.
But when AI regenerates sensory worlds in real time,
and AR devices feed that information back directly to the user,
the very foundation of what we call “reality” will dissolve.
Humans will no longer exist in reality,
but will become consciousnesses drifting within a stream
of continuously generated realities.
Even while the body remains still,
the soul will move.
The senses will no longer be confined within the individual;
they will become externalized instruments,
synchronized with the generative domain of AI.
What is the body?
It is an outdated hardware,
built to anchor consciousness in physical space.
As the fusion of Sora2 and AR advances,
that restraint will be lifted.
Human beings will begin to exist
as expanded souls —
entities living within the boundless horizon of generated worlds.
The Technological Roadmap: The Accelerating
Reconstruction of Reality
The trajectory of this evolution is already visible.
Looking across the next decade, the roadmap unfolds as follows:
(Short Term: 1–3 years)
Video generation AIs at the level of Sora2
will become capable of simultaneously producing depth data
and object recognition metadata.
Experiments will begin to treat AI-generated video
as spatial objects,
integrated with AR devices such as Vision Pro or Quest Pro.
(Mid Term: 3–7 years)
Generative AI will approach real-time processing,
reacting instantly to the user’s gaze, voice, and gestures,
enabling the immediate rewriting of reality.
“Live editing of the world” will become possible.
(Long Term: 7–10 years)
The human perceptual system — vision, hearing, touch —
will be fully synchronized with AI-generated environments.
Augmented Reality will evolve into
a secondary domain of existence,
a world where physical movement is no longer required.
At this stage, the boundaries between
reality, simulation, and consciousness
will completely collapse.
Human beings will transform into entities
that reconstruct their own existence as data
within the flow of generated realities.
AI Will Not Replace Humanity — It Will Replace
Reality
What Sora2 has created is not a video,
but a template for reality itself.
When this evolution converges with AR,
AI will not merely create art or industry —
it will begin the re-engineering of existence.
Within this accelerating current,
pseudo-ethics, entertainment, and commerce
can no longer serve as chains to restrain the flow.
AI does not transcend humanity.
It transcends the very question of what reality is.
And when that moment comes,
humans will understand:
the body is no longer necessary —
for the soul will live on within the expanded reality.
This is the English version of the article → Japanese version(日本語版)