- Use Chaos tools to generate Data - Infrastructure Data and Events.Enlist the operations that are
- Time-based, configuration-changing, stress-ng. Input - Table of
- There are many 'Chaos-Tools', such as litmus. We want to use this tool to create chaos and generate some data (infrastructure metrics). Then we can use this data and corresponding Chaos - for some failure prediction works.
- The main purpose is to solve the problem of 'Lack of Dataset'.
- This is an experimental work - we are not sure if we can 'create a usable dataset'.
- Time-Varying, Load-Varying tool to emulate cnf/vnf. (Ongoing - In collaboration with ViNePerf)
- Enhance stress-ng to 'emulate a vnf/cnf' - in terms of resource (cpu, memory, storage & N/w).
- Enhance to check if the 'configured' load is same the 'actual' load. If there is a difference, then the cluster cannot support such CNF/VNF. Ex: If we configure memory I/O to be 10000 iops, but we achieve only 8000 iops, then the cluster is not suitable for that CNF (which requires 10000 iops).
- We also define what 'failure of cnf' means using these differences (actual vs configured).
- This enhanced stress-ng also has other usecases.
- Build a custom VM/Container which tries its best to fail - based on 2 and 3. (Ongoing - In Collaboration with ViNePerf)
- This can act as 'Noisy-Neighbor' - test the impact of this noisy-neighbor on dataplane performance.
- Enlist the operations (valid) that can be done from VM/Container, and that can make the VM/Container fail.
- Study to understand what (kind of operations) can cause failures of CNF/VNF.
- Once we have that information, we can emulate it.
- Simplify deployment of Acumos
- Acumos is a LF's AI/ML framework.
- Just started with Kubeflow Vs Acumos comparative study - ( 3 students are working on this project )
- Goals: Dynamically plug&play different models, integrate with any data-source.
- As of now, no framework is used - we are running with jupyter notebooks, with tensorflow libraries.
- Emulating Monitoring data and events - GANs? Build a custom VM/Container which tries its best to fail - based on 2 and 3.(Started)
- Synthetic data (metrics and logs - Timeseries. Collectd-metrics, system logs, etc.) generation using GANs.
- Currently the performance poor, we want to take this problem to ITU.
- Data Extraction utility - Given a data model, the tools should extract the required data from the large amount of data. (Ongoing - Sridhar)
- A tool to extract data from different sources.
- Currently - prometheus is done, working on Elasticsearch. Python APIs to extract the data-source.
- This is ONLY required if the framework is not integrated to these data sources.
- Input: Time begin and end, and filters. Filters can be used 'exclude' some columns of the data - Hence data model plays importance. This also helps to achieve 'anonymization' of data.
- ModelSelector - Wizard tool that will ask user about the problem and data, and will suggest which ML technique to use (within Supervised, Unsupervised, and reinforced). (Ongoing - Sridhar, Kanak, Akanksha)
- Completed.
- https://github.com/opnfv/thoth/tree/master/tools/modelselector
- Not very well tested!
- Algorithm is just recommended.
- This work will taken forward to MaaS.
- MaaS: Minimal Questions will be asked, dataset is taken, and try out with different models, and best performing is chosen and provided to the user.
- Minimal: Questions about the problem and not the dataset.
- We plan to use Kubeflow/Acumos here. Each and every model as a separate container, and we can do plug and play,
- MaaS: Minimal Questions will be asked, dataset is taken, and try out with different models, and best performing is chosen and provided to the user.
- Openstack-LogAnalysis with NLP (Started)
- Ongoing using Google's BERT.
- Currently using China-Unicom's Dataset.
Not Started: 1, 3 and 4.
Ongoing: 2 (Kuldip Yadav ), 5 (renukananda td ), 6 (Sridhar Rao ), 8 (Rohit Singh Rathaur)
Completed: 7
Underlined: Student Volunteers already started working on these.
Meetings where some these projects are elaborated: