Publication

Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI

This is the first work that allows multimodal LLMs to elastically switch between input data modalities at runtime, for embodied AI applications such as autonomous navigation. Our basic technical approach is to use fully trainable projectors to adaptively connect the unimodal data encoders being used to a flexible set of last LLM blocks. In this way, we can flexibly adjust the amount of LLM blocks being connected to balance between accuracy of runtime fine-tuning cost, and optimize the efficiency of cross-modal interaction by controlling the amount of information being injected in each connection. Our implementations on NVidia Jetson AGX Orin demonstrate short modality adaptation delays of few minutes with mainstream LLMs, 3.7x fine-tuning FLOPs reduction, and 4% accuracy improvements on multimodal QA tasks.

Kai Huang, Boyuan Yang, Wei Gao

December 2023 In arXiv

Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI

Towards Green AI in Fine-tuning Large Language Models via Adaptive Backpropagation

The growing need of fine-tuning large language models (LLMs) can lead to significant energy consumption and environmental impact. To address this issue, we introduce GreenTrainer, a novel LLM fine-tuning technique. GreenTrainer assesses the backpropagation costs and contributions of different tensors to model accuracy, allowing for the selection of the most efficient set of tensors. This selection is guided by a user-defined objective, which can adapt to energy supply considerations and Green AI goals. Experimental results demonstrate that GreenTrainer can reduce FLOPs by up to 64% without compromising model accuracy, and outperforms existing techniques like LoRA while maintaining comparable FLOPs reduction.

Kai Huang, Hanyun Yin, Heng Huang, Wei Gao

September 2023 2024 ICLR

Towards Green AI in Fine-tuning Large Language Models via Adaptive Backpropagation

Tackling the Unlimited Staleness in Federated Learning with Intertwined Data and Device Heterogeneities

Federated Learning (FL) efficiency is influenced by intertwined data and device heterogeneities. Traditionally, these factors are treated separately, which becomes ineffective in addressing staleness issue due to asynchronous FL. We introduce a novel FL framework employing the gradient inversion technique to get estimations of clients’ local training data from their uploaded stale model updates, and use these estimations to compute non-stale client model updates, which addresses both data quality and privacy concerns. Experiments on mainstream datasets reveal our approach enhances model accuracy by up to 20% and accelerates FL training by up to 35% over existing methods.

Haoming Wang, Wei Gao

September 2023 In arXiv

Tackling the Unlimited Staleness in Federated Learning with Intertwined Data and Device Heterogeneities

PTEase: Objective Airway Examination for Pulmonary Telemedicine using Commodity Smartphones

The first mobile health system that turns a commodity smartphone into a fully functional pulmonary examination device to measure the internal physiological conditions of human airways, such as airway caliber, obstruction and possible inflammation. Information about these airway conditions could provide vital clues for precise and objective pulmonary disease evaluation.

Xiangyu Yin, Kai Huang, Wei Gao

June 2023 In MobiSys'23

PTEase: Objective Airway Examination for Pulmonary Telemedicine using Commodity Smartphones

ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection

The first on-device AI technique that achieves full elasticity of on-device training on resource-constrained mobile and embedded devices. By leveraging the principle of eXplainable AI (XAI) and evaluating the importance of different tensors in training, we allow fully flexible adaptation of the trainable neural network portion at runtime, according to the current training needs and online data patterns, to minimize the training cost without accuracy loss.

Kai Huang, Boyuan Yang, Wei Gao

June 2023 In MobiSys'23

ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection

Out-Clinic Pulmonary Disease Evaluation via Acoustic Sensing and Multi-Task Learning on Commodity Smartphones

This work prsents AWARE, a new sensing and AI system that supports accurate and reliable pulmonary function tests (PFT) using commodity smartphones. AWARE uses a smartphone to transmit acoustic signals and reconstructs the profile of human airway based on the analysis of reflected acoustic waves captured from the smartphone’s microphone. The subject’s pulmonary condition is then evaluated by a multi-task learning model to detect airway obstruction and inflammation and identify pulmonary diseases like asthma or COPD.

Xiangyu Yin, Kai Huang, Wei Gao

November 2022 In SenSys'22

Out-Clinic Pulmonary Disease Evaluation via Acoustic Sensing and Multi-Task Learning on Commodity Smartphones

AiFi: AI-Enabled WiFi Interference Cancellation with Commodity PHY-Layer Information

This work applies on-device AI techniques to interference cancellation in WiFi networks and enables generalizable interference cancellation on commodity WiFi devices without any extra RF hardware. By using neural network models to mimic WiFi network’s PHY-layer operation, AiFi can be generally applied to different types of interference signals ranging from concurrent WiFi transmissions, ZigBee/Bluetooth to wireless baby monitors or even microwave oven, and improves the MAC-layer frame reception rate by 18x.

Ruirong Chen, Kai Huang, Wei Gao

November 2022 In SenSys'22

AiFi: AI-Enabled WiFi Interference Cancellation with Commodity PHY-Layer Information

Real-time Neural Network Inference on Extremely Weak Devices: Agile Offloading with Explainable AI

AgileNN is the first work that achieves real-time inference (<20ms) of mainstream neural network models (e.g., ImageNet) on extremely weak MCUs (e.g., STM32 series with <1MB of memory), without impairing the inference accuracy. The usage of eXplainable AI (XAI) techniques allows >6x improvement of feature compressibility during offloading and >8x reduction of the local device’s resource consumption.

Kai Huang, Wei Gao

October 2022 In MobiCom'22

Real-time Neural Network Inference on Extremely Weak Devices: Agile Offloading with Explainable AI

TransFi: emulating custom wireless physical layer from commodity wifi

TransFi realizes fine-grained signal emulation and allows commodity WiFi devices to emulate custom wireless physical layer, including but not limited to, custom PHY-layer preambles and new ways of agile spectrum usage. It could also improve the performance of cross-technology communication and many other wireless applications by up to 50x, enabling high-speed data communication on par with commodity WiFi.

Ruirong Chen, Wei Gao

June 2022 In MobiSys'22

TransFi: emulating custom wireless physical layer from commodity wifi

RAScatter: Achieving Energy-Efficient Backscatter Readers via AI-Assisted Power Adaptation

A new backscatter system of precise, adaptive and lightweight power adaptation towards energy-efficient backscatter readers. It learns the entangled correlation between backscatter channel conditions, transmit RF power and goodput by designing a modular neural network to avoid redundancy and any confusion in training. Experiment results show improvement in energy efficiency at readers by 3.5× and reduces power consumption by up to 80%.

Kai Huang, Wei Gao

June 2022 In IoTDI'22

RAScatter: Achieving Energy-Efficient Backscatter Readers via AI-Assisted Power Adaptation

FaceListener: Recognizing Human Facial Expressions via Acoustic Sensing on Commodity Headphones

FaceListener transforms the commodity headphone into an acoustic sensing device, which captures the face skin deformations caused by fa-cial muscle movements with different facial expressions. To ensure the recognition accuracy, FaceListener leverages the knowledge distillation technique to learn the subtle correlation between face skin deformation and the acoustic signal changes.

Xingzhe Song, Kai Huang, Wei Gao

May 2022 In IPSN'22

FaceListener: Recognizing Human Facial Expressions via Acoustic Sensing on Commodity Headphones

Eavesdropping user credentials via GPU side channels on smartphones

This is one of the few works that demonstrate critical security vulnerabilities of mainstream GPUs (QualComm Adreno GPU on Snapdragon SoCs) on smartphones, which allow an unprivileged attacker to eavesdrop the user’s sensitive credentials such as app username and password.

Boyuan Yang, Ruirong Chen, Kai Huang, Wei Gao

March 2022 In ASPLOS'22

Eavesdropping user credentials via GPU side channels on smartphones

MyoMonitor: Evaluating Muscle Fatigue with Commodity Smartphones

We present a new technique to mimic an active sonar system with the smartphone’s built-in microphone and speaker, and use this sonar system to evaluate muscle fatigue from the muscle’s surface characteristics that can be measured from the transmitted acoustic signal.

Xingzhe Song, Wei Gao

March 2021 In Smart Health

MyoMonitor: Evaluating Muscle Fatigue with Commodity Smartphones

MagHacker: eavesdropping on stylus pen writing via magnetic sensing from commodity mobile devices

We present MagHacker, a new sensing system that realizes such eavesdropping attack over commodity mobile devices, which monitor and analyze the magnetic field being produced by the stylus pen’s internal magnet. It divides the continuous magnetometer readings into small segments that represent individual letters, and then translates these readings into writing trajectories for letter recognition.

Yihao Liu, Kai Huang, Xingzhe Song, Boyuan Yang, Wei Gao

June 2020 In MobiSys'20

MagHacker: eavesdropping on stylus pen writing via magnetic sensing from commodity mobile devices

SpiroSonic: monitoring human lung function via acoustic sensing on commodity smartphones

The first work that allows commodity smartphones to be used as a portable spirometer and provide accuracy lung function test results on par with clinical-grade spirometers. This is a collaborative work with the Children’s Hospital of Pittsburgh, and could also potentially contribute to in-home evaluation of COVID-19 risks by allowing convenient out-of-clinic lung function evaluation.

Boyuan Yang, Wei Gao

April 2020 In MobiCom'20

SpiroSonic: monitoring human lung function via acoustic sensing on commodity smartphones

StarLego: Enabling Custom Physical-Layer Wireless over Commodity Devices

StarLego is a wireless system that can produce custom wireless signals over commodity devices without hardware modification. It is showcased by implementing a custom WiFi PHY preamble, and exhibits great promise to facilitate penetration of new wireless PHY techniques to existing wireless systems.

Ruirong Chen, Wei Gao

March 2020 In HotMobile'20

StarLego: Enabling Custom Physical-Layer Wireless over Commodity Devices

EasyPass: combating IoT delay with multiple access wireless side channels

EasyPass exploits the excessive SNR margin in a wireless channel as a dedicated side channel for IoT traffic, and allow multiple access to the side channel by separating signals from different transmitters on the air. It allows multiple IoT devices to simultaneously transmit data over a congested wireless link without being delayed.

Haoyang Lu, Ruirong Chen, Wei Gao

December 2019 In CoNEXT'19

EasyPass: combating IoT delay with multiple access wireless side channels

Device-Free Acoustic Motion Tracking over Targets with Large Sizes

Device-free acoustic motion tracking allows a commodity mobile device to precisely track the human user’s motion, without applying any extra hardware tracker on the human body. We present a new system that ensures precise motion tracking over large targets by correctly interpreting the reflected acoustic signal and extracting the phase from the signal.

Yuqi Li, Ruirong Chen, Xingzhe Song, Wei Gao

November 2019 In 2019 MASS

Device-Free Acoustic Motion Tracking over Targets with Large Sizes

Enabling Cross-Technology Coexistence for Extremely Weak Wireless Devices

EmBee is designed to exploit the diversity of different wireless technologies’ spectrum utilization, so as to adaptively reserve occupied spectrum from the strong devices for weak wireless devices’ concurrent data transmissions. It is a new wireless PHY technique that enables cross-technology coexistence at zero cost or performance loss to these extremely weak wireless devices.

Ruirong Chen, Wei Gao

May 2019 In INFOCOM 2019

Enabling Cross-Technology Coexistence for Extremely Weak Wireless Devices

Minimizing Wireless Delay with a High-Throughput Side Channel

We present a novel design of high-throughput wireless side channel, which operates concurrently with the existing wireless network channel over the same spectrum but dedicates to real-time traffic. Our key idea of realizing such a side channel is to exploit the excessive SNR margin in the wireless network to encode data as patterned interference. We design such patterned interference in the form of energy erasure over specific subcarriers in an OFDM-based wireless network, and achieve a data rate of 1.25 Mbps in the side channel without affecting the existing wireless network links.

Ruirong Chen, Haoyang Lu, Wei Gao

April 2019 In IEEE TMC Volume 19 Issue 7

DeltaVR: achieving high-performance mobile VR dynamics through pixel reuse

This work leverages the unique characteristics of image warping used in current VR applications, and fundamentally expand the scope of image warping to the entire VR lifespan to precisely capture the fluctuations of VR scene due to VR dynamics. We implemented our design over Android OS and Unity VR application engine, and demonstrated that our design can maximize the mobile VR performance over highly dynamic VR scenarios with 95% less amount of VR frame data being transmitted.

Yong Li, Wei Gao

April 2019 In IPSN'18

DeltaVR: achieving high-performance mobile VR dynamics through pixel reuse

MUVR: Supporting Multi-User Mobile Virtual Reality with Resource Constrained Edge Cloud

MUVR aims to remove the performance constraint of highly dynamic VR appliations by adaptively reusing the redundant VR frames being rendered for different VR users. The redundancy in each frame is decided at run-time by the edge cloud, which further reuses its redundant pixels compared with other frames. The design implementation over Android OS and Unity VR demonstrated that the design can reduce edge computation burden and transmitted VR frame data.

Yong Li, Wei Gao

October 2018 In SEC 2018

MUVR: Supporting Multi-User Mobile Virtual Reality with Resource Constrained Edge Cloud

Continuous Wireless Link Rates for Internet of Things

This work presents vMod, a lightweight and practical solution towards maximum wireless network throughput by redesigning the wireless link rates from discrete to continuous. The key idea of vMod is to modulate a fractional number of data bits into each symbol by employing the Variable-Length Code (VLC), which is able to statistically yield any link rate.

Haoyang Lu, Wei Gao

April 2018 In IPSN'18