1. Detect soft speech sound in the environment 2. Set the sensitivity with calibration coefficient 3. Playback controlling