Anuket Project

Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 2 Next »


Volunteers


NameML Category
Jahanvi                                  Supervised                                        
AkankshaUnsupervised
Kanak RajReinforced

Supervised

Algorithms

NameComments on ApplicabilityReference









Un-supervised

Algorithms

NameComments on ApplicabilityReference









Reinforcement Learning

  1. Active Learning
  2. No labeled data
  3. Can afford to make mistakes?
  4. Is it possible to use a simulated environment for the task?
  5. Lots of time
  6. Think about the variables that can define the state of the environment.
    1. State Variables and Quantify them
    2. The agent has access to these variables at every time step
    3. Concrete Reward Function and Compute Reward after action
    4. Define Policy Function

Algorithms

NameComments on ApplicabilityReference

Q Learning












  • No labels