...
- Survey if completed.
- Testbed is assigned - Pod12-Jump
- Framework : Acumos (too many issues).
- Problem Domain - Failure Prediction
- Clear Definition of Failure Prediction - Ongoing.
- Existing Models with FP - ARIMA or RNN - Used to deploy and test.
- Enhancement to Existing works on FP - Not yet started
- Data Gathering: (Important*)
- Publicly Available: Searching...
- Collecting from existing testbeds: WIP
Sl. No. | Topic | Presenter | Notes |
---|---|---|---|
1 | Framework Deployment Status | Acumos - Container/K8S based approach. Vanilla deployment - Failure to deploy for both approached (with and without cluster deployment).
| |
2 | Survey - Implementation details - Status | Completed https://docs.google.com/spreadsheets/d/15XRdrWvbSCPsg1zZ9PfT9yvnElq21AvB/edit#gid=971676644 | |
3 | Model Deployment Status | Waiting for the Framework to be UP - to run on the testbed. Currently running locally - Google Collab. (Jupyter Notebooks). Data: CPU consumption. Failure: VM. | |
4 | Publicly Available Data | To be added by Girish/Rohit: |
4 | Failure Prediction Definition - Status | Existing works:
Gaps:
How to collect Data: Take advantage of Chaos Engg Project - Litmus, Pumba, blockade etc. |