What are model weights?

Model weights are the secret sauce of learning.
Yann LeCun

How It Works:

Weights are numerical parameters inside a neural network that adjust during training to minimize prediction errors and encode learned patterns.

Key Benefits:

  • Expressivity: Captures relationships in data.
  • Transferable knowledge: Fine-tune pretrained weights on new tasks.
  • Compact representation: Thousands to billions of parameters.

Real-World Use Cases:

  • Transfer learning: Use ImageNet weights for new vision tasks.
  • Speech models: Warm-start from pretrained speech recognition weights.

FAQs

Can weights be shared?
Are bigger weights better?