abstract |
An electronic device with processor(s), memory, and a touch screen display presents a media item, where the media item is associated with a metadata structure that includes first information identifying at least a portion of an audio track, second information identifying one or more media files, and third information identifying one more audio and/or video effects. The presenting includes: displaying one or more media files associated with the media item; and playing back at least a portion of an audio track associated with the media item in synchronization with the one or more media files. While presenting the media item, the device: detects a touch input gesture; and, in response to detecting the touch input gesture, applies an audio and/or video effect specified by the third information to the audio track being played back and/or at least a portion of the one or more media files being displayed. |