Kyle (1985): Strategic Trading and Price Impact

glosten-milgrom-model shows that the spread exists because of adverse selection. But it treats informed traders as mechanical — they simply buy when is high and sell when low. Kyle (1985) asks the next question: what if the informed trader is strategic?

The answer produces the single most important parameter in execution analysis: , the price impact coefficient.

The Three Agents

Kyle’s model has exactly three participants:

  1. The insider: knows the true asset value with certainty. Submits a market order of size .
  2. Noise traders: submit random aggregate order flow . They trade for liquidity reasons unrelated to .
  3. The market maker: observes total order flow but cannot distinguish insider flow from noise. Sets price to break even in expectation.

The true value is , independent of .

The Linear Equilibrium

Kyle shows there exists a unique linear equilibrium with two components:

The insider’s optimal strategy

The insider trades proportionally to their information advantage . The coefficient governs how aggressively they trade.

The market maker’s pricing rule

The price is the prior mean plus a linear adjustment for observed order flow. The coefficient is the price impact — how much the price moves per unit of net order flow.

Solving for and

The market maker sets . Since and are jointly normal and :

Computing each term:

Therefore:

The insider maximizes expected profit :

First-order condition in :

Substituting back and solving the system of two equations:

Interpreting

The price impact has clean comparative statics:

  • increases with : more uncertainty about the true value means each unit of order flow is more informative. The maker must adjust prices more aggressively.
  • decreases with : more noise trading provides cover for the insider. The maker cannot distinguish signal from noise, so each unit of flow moves the price less.

The ratio is the signal-to-noise ratio of order flow. High signal-to-noise means high price impact.

Key Results

1. The insider trades gradually

With , the insider scales their trade to the noise level. More noise means larger trades (more cover). The insider never reveals all information at once — they optimally spread their trades to avoid moving the price too much.

2. Half the information is incorporated

In this single-period model, the price after trading incorporates exactly half of the insider’s private information:

The remaining half is “saved” — in the multi-period version, the insider spreads information incorporation across all trading rounds.

3. Noise traders are essential

Without noise (), and the market collapses. The maker would perfectly infer from any trade, so the insider cannot profit, so they do not trade, so there is no price discovery. Noise traders are not parasites — they are the necessary substrate for markets to function.

4. Insider profit

Profit increases with both (more information to exploit) and (more cover to exploit it behind).

From Kyle to Execution Algorithms

Kyle’s is the foundation of modern execution cost analysis. When an institution needs to sell 1M shares, the execution desk faces exactly the Kyle problem: their order will move the price, and the price impact is roughly linear in order size (for moderate sizes).

The VWAP, TWAP, and implementation shortfall algorithms all attempt to minimize -mediated impact by:

  • Splitting orders across time (spreading like Kyle’s insider)
  • Trading during high-volume periods (when is large)
  • Randomizing timing (to look like noise)

Almgren-Chriss (2001) formalized this as an optimal control problem: trade off the urgency cost (risk of price drift while waiting) against the impact cost ( per unit traded).

Connection to AMMs: Convex vs. Linear Price Impact

In Kyle’s model, price impact is linear: . In a constant-product AMM, price impact is convex: the marginal price moves faster as trade size increases relative to pool reserves.

For a pool with reserves and constant product , the effective price impact of buying units is:

This convexity means:

  • Small trades experience approximately linear impact (resembling Kyle)
  • Large trades face rapidly increasing impact — a natural defense against informed trading, but also a cost for legitimate large liquidity needs
  • Unlike Kyle’s maker, the AMM cannot modulate — it is hardcoded into the bonding curve

The implication: AMMs are too generous to small informed trades (low impact) and too punitive to large uninformed trades (high impact), compared to what a Kyle-rational market maker would offer.


Companion notebook: notebooks/market-microstructure/05-kyle-lambda.py — simulate the Kyle equilibrium, estimate from order flow data, compare linear vs. convex price impact schedules.

Questions to sit with:

  1. Kyle assumes the insider knows with certainty. What changes if the insider has a noisy signal instead? Does increase or decrease?
  2. In crypto markets, on-chain order flow is visible before execution (mempool). How does this change the insider’s optimal strategy?
  3. If you were designing an AMM, how would you introduce an adaptive that responds to estimated informed trading fraction?