Skip to end of metadata
Go to start of metadata
You are viewing an old version of this page. View the current version.
Compare with Current
View Page History
« Previous
Version 2
Next »
Volunteers
Name | ML Category |
---|
Jahanvi | Supervised |
Akanksha | Unsupervised |
Kanak Raj | Reinforced |
Supervised
Algorithms
Name | Comments on Applicability | Reference |
---|
|
|
|
|
|
|
|
|
|
Un-supervised
Algorithms
Name | Comments on Applicability | Reference |
---|
|
|
|
|
|
|
|
|
|
Reinforcement Learning
- Active Learning
- No labeled data
- Can afford to make mistakes?
- Is it possible to use a simulated environment for the task?
- Lots of time
- Think about the variables that can define the state of the environment.
- State Variables and Quantify them
- The agent has access to these variables at every time step
- Concrete Reward Function and Compute Reward after action
- Define Policy Function
Algorithms
Name | Comments on Applicability | Reference |
---|
Q Learning |
|
|
|
|
|
|
|
|