Skip to content

7.2.4 LLM Industry Landscape

LLM industry landscape layer map

After completing this section, you will be able to:

  • Understand the LLM ecosystem from an industry-chain perspective
  • Distinguish what the model layer, platform layer, and application layer each do
  • Understand the different strengths of open-source and closed-source approaches
  • Practice model selection thinking with a small example

Model layer: who is building the “engine”?

Section titled “Model layer: who is building the “engine”?”

The model layer is mainly responsible for training foundation models and general-purpose models. You can think of it as “the people who build the engine.”

This layer usually focuses on:

  • Model architecture
  • Training data
  • Training compute
  • Model capabilities

Common forms include:

  • Closed-source API models
  • Open-source downloadable models
  • Industry-specific models

Platform layer: who makes models easier to use?

Section titled “Platform layer: who makes models easier to use?”

The platform layer is like “the people who build roads and supply electricity.”

What they usually do includes:

  • Model hosting
  • Inference services
  • Vector databases
  • Monitoring and evaluation
  • Fine-tuning platforms
  • Agent / workflow development frameworks

Without the platform layer, many teams would struggle to put models into production reliably, even if they already had the model.

Beginner glossary: words that appear often in the platform layer

Section titled “Beginner glossary: words that appear often in the platform layer”
TermWhat it meansWhy it matters
APIA standard interface for calling a model or serviceLets your application request model results without managing the model internals
InferenceRunning the model to produce an outputThis is what happens every time a user asks a question
Vector databaseA database optimized for storing and searching embeddingsIt is often used as the retrieval layer in RAG systems
MonitoringObserving latency, errors, cost, and output quality over timeProduction systems need this to detect problems early
EvaluationMeasuring whether outputs meet task requirementsIt prevents teams from judging models only by feeling

Application layer: the layer closest to users

Section titled “Application layer: the layer closest to users”

The application layer sells results, not models

Section titled “The application layer sells results, not models”

The application layer is more like “the people who run a restaurant.”

Users usually do not care which attention mechanism you use. They care about:

  • Whether it can help me complete the task
  • Whether the answer is reliable
  • Whether it is fast enough
  • Whether the cost is acceptable

Typical applications include:

  • AI search
  • AI customer service
  • AI coding assistants
  • AI office tools
  • AI teaching assistants

The same model can become many different products

Section titled “The same model can become many different products”

The same foundation model may turn into completely different products in different teams:

  • Legal assistant
  • Sales assistant
  • Education assistant
  • Code review tool

This shows that industry competition is not only about “whose model is bigger.” It also happens in:

  • Workflow design
  • Data accumulation
  • Product experience
  • Industry know-how

How should you choose between open-source and closed-source approaches?

Section titled “How should you choose between open-source and closed-source approaches?”

Closed-source models are more like “plug-and-play mature engines”

Section titled “Closed-source models are more like “plug-and-play mature engines””

Their advantages are usually:

  • Strong out-of-the-box performance
  • Less model maintenance work
  • Faster time to launch

Their trade-offs are usually:

  • Cost is charged per API call
  • Less controllable
  • Private deployment is more limited

Open-source models are more like “engines you can modify yourself”

Section titled “Open-source models are more like “engines you can modify yourself””

Their advantages are usually:

  • Can be self-hosted
  • Can be fine-tuned
  • More control over data and inference pipeline

Their trade-offs are usually:

  • Deployment and maintenance are more complex
  • Performance is not always naturally the strongest
  • Requires more engineering capability

One-sentence memory aid:

Closed-source is easier to use, open-source is more controllable.


What many teams really compete on is “system capability”

Section titled “What many teams really compete on is “system capability””

The model is only one component in the system

Section titled “The model is only one component in the system”

In real-world LLM products, it is usually not “the model acting alone,” but the collaboration of an entire system:

  • Prompt
  • RAG
  • Tool calling
  • Evaluation framework
  • Safety strategy
  • Cost control

In other words:

User experience = model capability × system design × data quality

Why can the same model feel very different in different products?

Section titled “Why can the same model feel very different in different products?”

Because what really determines the experience often also includes:

  • How good the knowledge base is
  • How accurate the tools are
  • Whether there is a fallback when things fail
  • How well latency is controlled

This is also why “being able to call an API” is not the same as “being able to build an AI product.”


Do not ask first, “Which one is the strongest?” Ask first, “What do I need?”

Section titled “Do not ask first, “Which one is the strongest?” Ask first, “What do I need?””

Common selection dimensions include:

DimensionQuestion to ask
QualityIs the task performance good enough?
CostIs each call expensive?
LatencyCan users accept the response speed?
ControllabilityCan it be privately deployed, fine-tuned, and audited?
MultimodalDo you need image understanding or audio?
Tool capabilityDo you need function calling / agent support?

This example is not about choosing a real latest model. It is for practicing how to score based on requirements.

models = {
"cloud_api_model": {
"quality": 9,
"cost": 4,
"latency": 8,
"control": 4
},
"open_source_8b": {
"quality": 6,
"cost": 9,
"latency": 7,
"control": 9
},
"open_source_70b": {
"quality": 8,
"cost": 5,
"latency": 5,
"control": 9
}
}
weights = {
"quality": 0.4,
"cost": 0.2,
"latency": 0.2,
"control": 0.2
}
def score_model(info, weights):
return sum(info[k] * weights[k] for k in weights)
scores = []
for name, info in models.items():
scores.append((score_model(info, weights), name))
for score, name in sorted(scores, reverse=True):
print(name, "->", round(score, 2))

Expected output:

Terminal window
open_source_8b -> 7.4
open_source_70b -> 7.0
cloud_api_model -> 6.8

Model-selection scoring result map

You can change weights to simulate different preferences in different companies.


Why does the “industry landscape” matter to engineers too?

Section titled “Why does the “industry landscape” matter to engineers too?”

Because you make technical choices every day

Section titled “Because you make technical choices every day”

You will keep running into questions like:

  • Use an API or self-host it?
  • Start with RAG or start with fine-tuning?
  • Use a general-purpose model or a vertical model?
  • Use a single model or multi-model routing?

At their core, these questions are all related to industry structure.

Because technology choices affect your career path

Section titled “Because technology choices affect your career path”

Different roles tend to emphasize different skills:

  • Foundation models: more focused on training and algorithms
  • Platform engineering: more focused on inference, deployment, and optimization
  • Application engineering: more focused on product, workflows, and evaluation

Understanding the industry landscape can help you more clearly see which kind of role you want to move toward.


Keep this page’s proof of learning as a small evidence card:

Model Choice
closed, open, hosted, or self-deployed option
Decision Factors
quality, latency, privacy, cost, ecosystem
System Capability
model + data + product + eval + ops
Risk Note
vendor lock-in, compliance, or deployment constraint
Evidence
one model-selection table for a real use case

Leaderboards are useful, but they are not everything. In real projects, cost, latency, and stability are equally important.

A model being open-source does not mean training, deployment, and maintenance are all cheap.

Thinking there is always a “best model”

Section titled “Thinking there is always a “best model””

Many times there is no “absolute best,” only “the most suitable for the current scenario.”


The most important idea in this section is:

The LLM industry is not just a competition of model parameters. It is a competition of the combination of models, platforms, data, products, and engineering capability.

For application builders, understanding the industry landscape is not about following trends. It is about making more stable technical and product decisions.


  1. Change the weights in the scoring script to simulate the model-selection preferences of a “startup team” and a “financial enterprise.”
  2. Think about this: if your project requires private deployment, how would the priority between open-source and closed-source approaches change?
  3. Explain in your own words: why is the real competitive advantage often not just the model itself?
Project reference and review notes
  1. A startup team may weight speed, cost, API simplicity, and iteration velocity higher. A financial enterprise should weight privacy, compliance, auditability, reliability, and vendor controls higher.
  2. Private deployment usually raises the priority of open-weight models, self-hosting, private cloud options, and data-governance controls. Closed-source APIs may still be useful if they satisfy isolation and compliance needs.
  3. Advantage often comes from proprietary data, workflow integration, evaluation loops, user trust, distribution, and operations. The model is important, but it is only one layer of the product system.