Introduction

  • Objective

Develop a smart speaker with voice control capabilities, utilizing the hardware resources of the Milk-V Duo 256MB and combining peripherals such as cameras and external screens to achieve basic playback functions through voice commands, including play, pause, song switching, and volume adjustment.

 

  • Detailed Task Description

1. Gesture Recognition System Development
a) Developers need to research and implement a gesture recognition algorithm suitable for the Milk-V Duo that can accurately recognize the following commands (specific commands and wake words can be adjusted as needed):

  • Wake word (self-defined)
  • Play/Pause
  • Next Track
  • Increase Volume
  • Decrease Volume

b) The algorithm should work stably under different lighting conditions and with background noise.

 

2. Smart Speaker Function Integration:

  • Implement a music playback system on the Milk-V Duo, supporting common audio formats.
  • Realize the basic control functions of the speaker, including play/pause, next track, and volume adjustment via gestures.

 

  • Performance Requirements
  1. The voice command recognition algorithm should have a high accuracy rate, with an accuracy of not less than 95%.
  2. The speaker control functions should respond quickly, with a total delay from command issuance to operation execution of no more than 1 second.
  3. After running continuously for 1 hour, the system should maintain stable detection accuracy, with CPU usage not exceeding 70%, and memory usage not exceeding 200MB.

 

  • Acceptance Criteria
  1. The voice command recognition system runs stably on the Milk-V Duo, accurately recognizing and responding to the five types of gestures mentioned.
  2. The smart speaker's functionality is complete, including play/pause, next track, and volume adjustment controlled by voice commands.
  3. The user interface is clear, the operation is smooth, with no significant bugs, and includes clear command prompts.
  4. Functional Testing: The system should pass at least 200 voice command tests, ensuring stable operation under different environmental conditions.
  5. The submitted project should include complete source code, documentation, and necessary resource files for subsequent maintenance and optimization.

 

By completing this task, developers will be able to demonstrate the application potential of the Milk-V Duo in smart interaction, IoT device control, and multimedia integration, while providing users with an innovative and intuitive way to control their devices.