Challenge From Huawei

We are looking for talents to solve the challenge of lifelong learning together! For professors in universities, we may have collaboration projects funded by Huawei to work on this problem. For students who are interested in this topic, welcome to join us as interns or employees.

Background

According to a market report, the global edge intelligence market is expected to reach US$3.093 billion by 2023, with a compound annual growth rate (CAGR) of 28.8%. Edge intelligence has potential applications in various industries, such as transportation, industry, energy, aerospace, media, and farming. However, one of the main challenges for monetizing its business value is to achieve large-scale replication. The current obstacles to replicating edge intelligence include small and heterogeneous data sets on edges (see figures below), as well as the users’ requirements for data security and offline autonomy of edge intelligence services.

Technical Challenges

  • Catastrophic forgetting: Single-task methods, such as isolated learning, single-task incremental learning, and single-task federated learning, build a monolithic model in the cloud for inference on edges. However, these methods suffer from catastrophic forgetting [2], which means that the historical model is overwritten by the updated model and loses its previous knowledge. Moreover, it is challenging to use different models to solve different problems with these methods.
  • AI engineering and automation: Multi-task methods, such as multi-task learning, multi-task incremental learning, and multi-task federated learning, manually define multiple models to adapt to different edges. However, these methods require considerable human and computational resources for building and maintaining such models. AI engineering and automation are still far from mature.
  • Identification and processing of unknown tasks: Edge environments are complex and diverse, and the closed-world assumption —that all tasks in the inference stage are perfectly learned in the training stage —does not hold in edge scenarios. Therefore, it is necessary to assume that unknown tasks may exist during the running of edge models and to verify and update them in a timely manner to ensure the robustness and reliability of related services.
  • Research Topics and Benchmarks

    We conducted research on the core technologies and system optimization of edge-cloud collaborative lifelong learning based on the Sedna and Ianvs frameworks, achieving multi-model full lifecycle management. System Goals and Core Technologies:

  • Maximize the unknown task generalization metric(FWT), and the anti-forgetting metric(BWT) must reach 95%
  • The development time of new scenarios must be less than two weeks. Data cannot be transferred out of the edges, and offline autonomy of services must be implemented.
  • Models must be decoupled so that models are pluggable in complex and changeable edge scenarios.
  • Partition latent tasks in a given dataset, and match highest-similarity tasks for inference samples. The task matching accuracy must reach 95%.
  • Unknown and known tasks are distinguished in inference. The identification accuracy of unknown tasks must reach 99%.
  • Optimize models in the knowledge base with the help of data and models from different resources, such as mechanism models, task (transfer) relation discovery, simulation data, and large-scale pre-trained models. The anti-forgetting (BWT) and generalization (FWT) metrics must reach 95%.
  • We offer Cloud-Robotics, a real-world lifelong learning dataset for semantic segmentation in robotics. Please contact Dr. Zheng Zimu(zimu.zheng@huawei.com) for more details.