Volunteers

Name	ML Category
Jahanvi	Supervised
Akanksha	Unsupervised
Kanak Raj	Reinforced

...

Un-supervised

Clustering - hierarchical clustering, k-means, mixture models, DBSCAN, and OPTICS algorithm
Anomaly Detection - Local Outlier Factor, and Isolation Forest
Dimensionality Reduction - Principal component analysis, Independent component analysis, Non-negative matrix factorization, Singular value decomposition

...

Name	Comments on Applicability	Reference
Hierarchical Clustering	(N-1) combination of clusters are formed to choose from. Expensive and slow. n×n distance matrix needs to be made. Cannot work on very large datasets. Results are reproducible. Does not work well with hyper-spherical clusters. Can provide insights into the way the data pts. are clustered. Can use various linkage methods(apart from centroid).
k-means	Pre-specified number of clusters. Less computationally intensive. Suited for large dataset. Point of start can be random which leads to a different result each time the algorithm runs. K-means needs circular data. Hyper-spherical clusters. K-Means simply divides data into mutually exclusive subsets without giving much insight into the process of division. K-Means uses median or mean to compute centroid for representing cluster.
Gaussian Mixture Models	Pre-specified number of clusters. GMs are somewhat more flexible and with a covariance matrix we can make the boundaries elliptical (as opposed to K-means which makes circular boundaries). Another thing is that GMs is a probabilistic algorithm. By assigning the probabilities to data points, we can express how strong is our belief that a given data point belongs to a specific cluster. GMs usually tend to be slower than K-Means because it takes more iterations to reach the convergence. (The problem with GMs is that they have converged quickly to a local minimum that is not very optimal for this dataset. To avoid this issue, GMs are usually initialized with K-Means.)

Reinforcement Learning

Active Learning
No labeled data
No supervisor, only reward
Actions are sequential
Feedback is delayed, not instantaneous.
Can afford to make mistakes?
Is it possible to use a simulated environment for the task?
Lots of time
Think about the variables that can define the state of the environment.

State Variables and Quantify them
The agent has access to these variables at every time step
Concrete Reward Function and Compute Reward after action
Define Policy Function

...

Name	Comments on Applicability	Reference
Q Learning

Is RL Possible?

Do you have very high computation power?
Do you have lots of time to train an agent?
Do you need your model to be self-explanatory, humans can understand the reasoning behind the predictions and decisions made by the model?
Do you need your model to be easy to implement and maintain?
Is it possible to try the problem several times and afford to make many mistakes?
In your situation, do active and online learning of algorithms is possible i.e while learning by actions, explore new data space and then learn from such conditions and data?
In your situation, Can the algorithm take sequential action and complete the task?
Is it possible to define policy function, actions that the agent takes as a function of the agent's state and the environment.?
Is it possible to define a function to receive feedback from actions, such that feedback helps to learn and take new action?
Can you simulate an environment for the task so that algorithm can try lots of times and can make mistakes to learn?

Image Added

Versions Compared

Old Version 10

New Version 11

Key

Volunteers

Un-supervised

Reinforcement Learning

Is RL Possible?

Page Comparison

Versions Compared

Old Version 10

New Version 11

Key

Volunteers

Un-supervised

Reinforcement Learning

Is RL Possible?