Q-Learning
A model-free reinforcement learning algorithm to learn the value of an action in a particular state.
Description
Q-Learning is a model-free reinforcement learning algorithm used to find an optimal action-selection policy for any given finite Markov decision process. It works by learning an action-value function that ultimately gives the expected utility of taking a given action in a given state and following the optimal policy thereafter. When such an action-value function is learned, the optimal policy can be constructed by simply selecting the action with the highest value in each state.
Examples
- 🎮 Game playing AI
- 🤖 Robot navigation
- 📊 Resource management
Applications
Related Terms
Featured

Wondershare Repairit
AI-powered data repair for videos, photos, audio, and files in minutes.

Zawa
AI Branding Design Agent

Lyro
AI support that feels human

Lium
AI for Complex Data

Wondershare Filmora
Edit as an Expert with Filmora AI

Wondershare Recoverit AI Data Recovery
AI recovery, AI data recovery, AI video recovery, AI video repair, AI photo recovery, AI photo repair

RemoveSynthID
Reduce invisible SynthID signals while keeping images clear and private.

CoSupport AI
AI-powered platform for automating customer support

AI Influencer Generator
Sceneform.ai is an AI platform for creating realistic virtual influencers, UGC ads, talking avatars, and short-form social videos at scale.

