Q-Learning
A model-free reinforcement learning algorithm to learn the value of an action in a particular state.
Description
Q-Learning is a model-free reinforcement learning algorithm used to find an optimal action-selection policy for any given finite Markov decision process. It works by learning an action-value function that ultimately gives the expected utility of taking a given action in a given state and following the optimal policy thereafter. When such an action-value function is learned, the optimal policy can be constructed by simply selecting the action with the highest value in each state.
Examples
- 🎮 Game playing AI
- 🤖 Robot navigation
- 📊 Resource management
Applications
Related Terms
Featured

Kimi AI
Kimi AI - K2 chatbot for long-context coding and research

Abacus AI
The World's First Super Assistant for Professionals and Enterprises

Higgsfield AI
Cinematic AI video generator with pro VFX control

Tidio
Smart, human-like support powered by AI — available 24/7.

AI PDF Assistant
AI PDF Assistant is an intelligent recommendation tool

Animon AI
Create anime videos for free

ChatGPT Atlas
The browser with ChatGPT built in

Sora 2
Transform Ideas into Stunning Videos with Sora 2

Blackbox AI
Accelerate development with Blackbox AI's multi-model platform

