Chuck de Sully | Machine Learning Engineer

Featured Projects

The Market as Music: Inside the Self‑Taught AI Cracking the Chaos of Trading

The GPTCN system represents a fundamental shift. Instead of memorizing buy/sell signals, it behaves like a self‑taught philosopher‑scientist, constructing an internal model of market dynamics.

The Death of the "Blue Line" Strategy

In the traditional theater of retail trading, the protagonist is often a single visual cue: the moving average crossover. This "blue line crossing the red line" represents a seductive simplicity, suggesting that the ocean of global liquidity can be distilled into a reactive geometry. Yet, for any practitioner who has faced a sideways market or a volatility spike, the flaw is evident. These systems are purely reactive, processing what has already happened without ever grasping the underlying "physics" that governs the movement.

The GPTCN (Generative Predictive Temporal Convolutional Network) system represents a fundamental architectural departure. It does not treat the market as a series of buy/sell signals to be memorized. Instead, it functions as a self-taught philosopher-scientist, constructing a proprietary internal model of market dynamics. By shifting from reactive observation to conceptual understanding, the system seeks to decode the structural integrity of a trend before a single trade is ever executed.

Stop Looking at the Price (The "Equalizer" View)

To a sophisticated neural network, raw price data is "terrible data." It is non-stationary; $180 for a share of Apple today carries a fundamentally different statistical weight than it did five years ago. To solve this, the GPTCN discards raw prices entirely in favor of a successive refinement of signal.

The pipeline begins with:

log returns,
which are then passed through four Exponential Moving Averages (EMAs) with varying "shutter speeds"—ranging from a slow, macroeconomic "long exposure" (0.05 alpha) to a twitchy, short-term "fast shutter" (0.30 alpha).

These curves are then fed into a Discrete Cosine Transform (DCT) using a rolling 10-day window.

This process transforms the market into a "spectral fingerprint." Like an audio equalizer, it separates the market’s "bass" (heavy, slow-moving trends) from its "treble" (high-frequency intraday noise). As the technical documentation notes, this approach provides a "richer, much more stable input" than price alone, allowing the AI to analyze the "texture" of volatility rather than the erratic fluctuations of a dollar amount.

Applying "Object Permanence" to Stock Trends

At the core of the GPTCN is the Joint Embedding Predictive Architecture (JEPA), a concept pioneered by Yann LeCun. In computer vision, JEPA is used to teach machines "object permanence"—the understanding that a car driving behind a building has not vanished, but continues to exist with a predictable trajectory.

GPTCN applies this to financial time series through a process known as "masking." However, unlike standard models that attempt to predict the exact value of a hidden data point—a task akin to transcribing a distorted audio recording—JEPA performs a "Mad Libs" exercise. It hides a chunk of the frequency data and tasks the AI with predicting the concept or context of what belongs in that gap. This forces the architecture to build an internal representation of how market patterns flow together, prioritizing conceptual integrity over the "fool’s errand" of guessing a closing price down to the penny.

Forced Diversity through "Mathematical Cage Matches"

A primary risk in deep learning is the "monolith" problem, where a model develops a singular blind spot. The GPTCN counters this by encoding its understanding into a dense 64-dimensional vector, which is then physically bifurcated into four independent 16-dimensional "slots" or specialists.

Each specialist is responsible for producing:

its own embedding,
a confidence "gate,"
a prediction "delta,"
and a "surprise metric"—a unique self-assessment of how unexpected the current market movement is.

To prevent these specialists from gravitating toward the same obvious trend, the system utilizes Prototype Repulsion Loss. This is a mathematical cage match: if the internal models of two specialists become too similar, the system heavily penalizes them. By forcing their vectors to remain orthogonal, the architecture mandates a diversity of thought, ensuring that if one slot focuses on momentum, another is mathematically required to look elsewhere, such as toward volume shocks or volatility clusters.

The Anti-Lazy Clause: Epps-Pulley and the Hinge Gate

AI is notoriously prone to "representation collapse"—a form of mathematical laziness where the model outputs strings of zeros to pass a test without actually learning. GPTCN combats this with Epps-Pulley Signature Regularization. This function "bullies" the AI into intellectual complexity by demanding its thoughts match a Gaussian bell curve. It projects the AI's internal embeddings onto random multi-dimensional spheres and runs harsh statistical tests; if the representation is too simple, the loss function explodes. As the developers describe it, "We’re forcing a calculator to be a philosopher."

This complexity is refined by the Hinge Gate Loss, which acts as a filter for conviction. It ensures that the AI's "gate" (its confidence level) is perfectly aligned with its "quality" (the actual accuracy of its internal model). It prevents the bot from "confidently guessing garbage," forcing a clinical honesty about its own ignorance.

"Chomp 1D": The Brutal Honesty of Causal Convolutions

The "Time Travel Trap" is the most frequent cause of backtesting failure: a model accidentally "peeks" at future data through symmetrical padding in its code. GPTCN utilizes a Temporal Convolutional Network (TCN) with dilated convolutions, which exponentially expands its field of view to see the immediate tick and the month-long trend simultaneously.

To maintain total causality, the system employs a class called Chomp 1D. This is a brute-force "guillotine" for data; it literally amputates the trailing padding from the data arrays, physically removing the space where future data would reside. This ensures the model remains strictly causal. The final output is squashed through a Softsign function, providing non-vanishing gradients that allow the AI to maintain its "non-linear" responsiveness even during the most extreme market dislocations.

Coding for Conviction (Synthetic Psychology)

The final stage of the assembly line is the Hybrid Reward function, which designs an artificial "personality" for the trader. The system balances:

Action Reward (points for the timing of position changes)
with Holding Reward (points for the patience to let winners run).

This "Synthetic Psychology" is enforced through two critical parameters:

The Whipsaw Penalty punishes indecision, hammering the bot if it flips positions too rapidly in a sideways market.
Conversely, the Variance Bonus is a mathematical mandate against "capital cowardice" or "bunker mentality." It rewards the bot for maintaining a position, preventing it from simply hiding in cash to avoid the risk of learning.

The result is an algorithm with the mathematical equivalent of courage and conviction.

Conclusion: The Grand Paradox of Hallucinated Order

The GPTCN operates as a disciplined assembly line: raw data is refined into frequency frames, analyzed by the JEPA specialists, translated into action by the causal TCN, and audited by the reward loop.

However, this sophistication leads to a profound philosophical paradox. By using tools like Epps-Pulley to mandate that the AI finds complex Gaussian structures in a market that may be fundamentally chaotic, are we discovering order or imposing it? If the market is a casino, the GPTCN is a scientist looking for the "physics" of the dice. But if enough capital begins to trade based on these "hallucinations" of complexity, the market itself may eventually change to match the AI’s expectations. We are left to wonder if we are solving the chaos, or simply creating a self-fulfilling prophecy where the line between finding a trend and creating one finally disappears.

Machine Learning Engineer | SaaS Veteran

About Me

Featured Projects

The Market as Music: Inside the Self‑Taught AI Cracking the Chaos of Trading

The Death of the "Blue Line" Strategy

Stop Looking at the Price (The "Equalizer" View)

Applying "Object Permanence" to Stock Trends

Forced Diversity through "Mathematical Cage Matches"

The Anti-Lazy Clause: Epps-Pulley and the Hinge Gate

"Chomp 1D": The Brutal Honesty of Causal Convolutions

Coding for Conviction (Synthetic Psychology)

Conclusion: The Grand Paradox of Hallucinated Order

Projects

Financial Market Signal Decomposition

Hybrid Transformer-LSTM Forecasting

Learnable Fuzzy Logic Layer

Why Hire Me

Contact Me