The annual Computer Vision and Pattern Recognition Conference (CVPR) ended in New Orleans. Leading technology company OPPO had seven papers accepted, making it one of the most successful at the event. OPPO won three first-place, one second-place, and four third-place prizes in conference competitions.
“In 2012, deep neural networks designed for image recognition rejuvenated the research and application of artificial intelligence. Ever since AI technology has seen a decade of rapid development,” said Guo Yandong, Chief Scientist in Intelligent Perception at OPPO.
“OPPO continues to promote artificial intelligence to accomplish complex perceptual and cognitive behaviors. We empower AI with higher cognitive abilities to understand and create beauty and develop embodied AI with autonomous behavior. I’m delighted to see that seven of our papers have been selected for this year’s conference. Building on this success, we will continue to explore both fundamental AI and cutting-edge AI technology, as well as the commercial applications that will enable us to bring the benefits of AI to more people.”
The Seven papers accepted by CVPR 2022 showcase OPPO‘s progress in creating humanizing AI
Seven papers submitted by OPPO for CVPR 2022 were selected for presentation at the conference. Their areas of research include multimodal information interaction, 3D human body reconstruction, personalized image aesthetics assessment, knowledge distillation, and others.
Cross-modular innovation is viewed as the way to ‘humanize’ artificial intelligence. Text data frequently includes an elevated degree of over-simplification, while visual picture data contains a lot of specific contextual details. OPPO researchers proposed a new CRIS framework based on the CLIP model to enable AI to get a more fine-grained understanding of the text and image modal data.
The biggest difference between human and artificial intelligence today lies in multimodality. Humans can undoubtedly figure out data in both words and pictures and draw relationship between the two sorts of data. The novel method proposed by OPPO improves multimodal intelligence, which could potentially lead to artificial intelligence being able to truly understand and interpret the world through multiple forms of information such as language, hearing, vision, and others, making the robot and digital assistants of sci-fi movies become a reality.
CRIS: CLIP-Driven Referring Image Segmentation
OPPO Research Institute has made progress in 3D human body recreating. OPPO revealed a technology at CVPR for autonomously creating digital avatars with natural attire. The OPPO model can reliably build 3D, 1:1 dynamic models with minor details like logos and fabric textures by analyzing RGB footage of individuals. Creating accurate 3D clothing models is difficult. The new model minimizes the needs for 3D human body reconstruction, giving technical foundations for virtual changing rooms, AI fitness training, and lifelike avatars in VR/AR worlds.
Structured Local Radiance Fields for Human Avatar Modeling
AI image recognition has now reached a stage where it can accurately identify a wide range of objects within an image. The ability of AI to evaluate images in terms of their perceived aesthetic quality is often strongly related to the big data used in training the AI model.
In collaboration with Leida Li, a professor from Xidian University proposed Personalized Image Aesthetics Assessment (PIAA) model. The model is the first to optimize AI aesthetics assessment by combining users’ subjective preferences with more generalized aesthetic values. In the future, the model will be used to create personalized experiences for users, not just limited to the curation of photo albums, but also provide recommendations on how to shoot the best photo and which content a user might prefer.
Personalized Image Aesthetics Assessment with Rich Attributes
OPPO has also chosen to make the PIAA model evaluation data set the open-source for developers, with a number of research institutions and universities already expressing an interest in using the data to further their own efforts in personalized AI aesthetic assessment.
OPPO also proposed a multi-view 3D semantic plane reconstruction approach for understanding 3D surfaces. INS-Conv (INcremental Sparse Convolution) was developed with Tsinghua University to improve online 3D semantic and instance segmentation. This can lower the computer power needed for environment identification, enabling its use in automated driving and VR.
OPPO makes AI ‘lightweight’ with second place win in the NAS Challenge
CVPR 2022 also saw a number of technical challenges take place, with OPPO placing third and above in eight challenges. These include the neural architecture search (NAS) challenge, SoccerNet, SoccerNet Replay Grounding, ActivityNet temporal localization, the 4th Large-scale Video Object Segmentation Challenge.
Deep learning methods are used in mobile photography and automated driving. Deep learning relies on huge data, calculation power, and cost, which hinders its practical deployment. NAS approaches find and implement optimum neural network topologies. OPPO researchers trained 45,000 mini neural networks to inherit the supernetwork’s parameters in the NAS competition.
Using the NAS technique, researchers only need to train a large super network and create a predictor to let the subnetworks learn by inheriting the super network parameters. This provides an efficient and low-cost approach to obtaining a deep learning model that outperforms those manually designed by expert network architects. This will ultimately bring previously unthinkable levels of AI technology to mobile devices in the near future.
During CPVR 2022, OPPO also participated in seminar presentations and three high-level workshops. At the SLAM seminar, OPPO researcher Deng Fan shared how real-time vSLAM could be run on smartphones and AR/VR devices. In AICITY Workshop, Li Wei proposed a multi-view based motion localization system to identify abnormal behavior of drivers while driving.
OPPO is bringing the benefits of AI to more people, sooner
This is the third year that OPPO has participated at CVPR. OPPO’s rising success at CVPR during these three years owes much to its continued investment in AI technology. At the beginning of 2020, the Institute of Intelligent Perception and Interaction was established under the OPPO Research Institute to further deepen OPPO’s exploration of cutting-edge AI technologies. Today, OPPO has more than 2,650 global patent applications in the field of AI.
Guided by its brand proposition, ‘Inspiration Ahead’, OPPO is also working with partners across the industry to take AI technology from the laboratory into daily life. OPPO’s AI technology has also been used to develop products and features such as the real-time spatial AR generator CybeReal, OPPO Air Glass, Omoji, and more. Through these technologies, OPPO is aiming to create more lifelike digital worlds that combine virtual and reality to create all-new experiences for users.