media

mirror of https://github.com/androidx/media.git synced 2025-04-30 06:46:50 +08:00

Author	SHA1	Message	Date
Copybara-Service	3d87578779	Merge pull request #1031 from garethfenn:hlschunkseek PiperOrigin-RevId: 603008793 (cherry picked from commit 45bd5c6f0a24b24282bdb971a3a780a9923eff01)	2024-02-08 17:30:04 +00:00
sheenachhabra	9581af110a	Write "stco" box instead of "co64" for fragmented MP4 As per MP4 spec ISO 14496-12: 8.7.5 Chunk Offset Box, Both "stco" and "co64" can be used to store chunk offsets. While "stco" supports 32-bit offsets, "co64" supports 64-bit offsets. In non fragmented MP4, the mdat box can be extremely large, hence muxer uses "co64" box. But for fragmented MP4, muxer does not write any data in this chunk offset box (present in "moov" box) because all sample related info is present in "moof" box. Technically, "co64" box should also work in fragmented MP4because its empty only but QuickTime player fails to play video if "co64" box is present in fragmented MP4 output file. Testing: Verified that QuickTime player does not play video when "co64" box is present but is able to play when "stco" box is present. #minor-release PiperOrigin-RevId: 601147046 (cherry picked from commit 0acf6902e57678dc9e6ea8de3a4b3b88b9ec0519)	2024-01-25 11:42:44 +00:00
rohks	cfb676adaf	Add `Mp4ExtractorTest` for sample with edit list (edts box) The sample has multiple sync frames for video track. PiperOrigin-RevId: 601129350 (cherry picked from commit f8dbbc82e2a36c4207b8408bd30d1b37dfd80eca)	2024-01-25 11:42:44 +00:00
sheenachhabra	539a8f9a24	Write sample data offset related fields in tfhd and trun box This fix makes output playable on VLC player. The output does not play on QuickTime player which is being fixed in a separate CL. #minor-release PiperOrigin-RevId: 601118813 (cherry picked from commit 806f90922b0d327f1f85e677ac2d9555415d60a4)	2024-01-25 11:42:44 +00:00
michaelkatz	db0262efdb	Keep stream offset alive in ImageRenderer until stream transition Fix modeled after OutputStreamInfo usage for stream offset in `MediaCodecRenderer` PiperOrigin-RevId: 601109900 (cherry picked from commit 688622eb47ac707affa824d3d68f44755f947380)	2024-01-25 11:42:44 +00:00
Copybara-Service	c403df116b	Merge pull request #973 from jan-varecka-signageos-io:feat/extractTsAudioType PiperOrigin-RevId: 599547201	2024-01-18 10:25:47 -08:00
sheenachhabra	666685bd85	Use 4 Byte size field in fMP4 mdat boxes Since mdat box can be huge so there is a provision to use 64 bit size field. In case of fragmented MP4, individual fragments should not have large mdat box so a 32 bit size field should be sufficient. PiperOrigin-RevId: 599219041	2024-01-17 10:36:47 -08:00
Jan Vařečka	286b1f52d9	Add test for MPEG audio with audio type	2024-01-17 10:22:55 +00:00
rahulnmohan	27c021fd7b	Merge Issue: androidx/media#275 : MPEG2-TS: Support DTS, DTS-LBR and DTS:X Profile2 Imported from GitHub PR https://github.com/androidx/media/pull/275 Added below mentioned features. - Support for extracting DTS LBR(DTS Express) and DTS UHD Profile 2(DTS:X) descriptor ID from PSI PMT - The DTSReader class is updated for extracting a DTS LBR. - Newly added DtsUhdReader class for extracting DTS UHD frame. - The DTSUtil class is updated to parse the DTS LBR or DTS UHD frame and report the format information. Feature request for ExoPlayer: https://github.com/google/ExoPlayer/issues/11075 Merge 21efa0810db31550d6b215639f9ca2af6a32139a into 104cfc322c095b40f88e705eb4a6c2f029bacdd6 COPYBARA_INTEGRATE_REVIEW=https://github.com/androidx/media/pull/275 from rahulnmohan:dts-mpeg2ts-update 21efa0810db31550d6b215639f9ca2af6a32139a PiperOrigin-RevId: 598854998	2024-01-16 08:54:43 -08:00
tofunmi	0155ae998b	Gaussian Blur: support blurring without drawing sharp image on top PiperOrigin-RevId: 598626481	2024-01-15 09:39:38 -08:00
jbibik	e51c293f75	Plumb SubtitleParser.Factory into MatroskaExtractor MatroskaExtractor will no longer be wrapped in SubtitleTranscodingExtractor, but instead use SubtitleTranscodingExtractorOutput under the hood. FLAG_EMIT_RAW_SUBTITLE_DATA flag will be used to toggle between subtitle parsing during extraction (before the sample queue) or during decoding (after the sample queue). The new extractor dump files generated by `MatroskaExtractorTest` now follow the new parsing logic and hence have mimeType as `x-media3-cues`. PiperOrigin-RevId: 598616231	2024-01-15 08:32:22 -08:00
ibaker	4061d476a1	MP3: Assume an Info header indicates CBR for seeking purposes The seek table in a Xing/Info header is very imprecise (max resolution of 255 to describe each of 100 byte positions in the file). Seeking using a constant bitrate assumption is more accurate, especially for longer files (which exacerbates the imprecision of the Info header). VBR files should contain an Xing header, while an Info header is identical but indicates the file is CBR. Issue: androidx/media#878 PiperOrigin-RevId: 597827891	2024-01-12 06:48:00 -08:00
tofunmi	78c419e566	Extension to Gaussian Blur: support changing blur over time PiperOrigin-RevId: 597809380	2024-01-12 04:53:02 -08:00
jbibik	59afb4fb01	Plumb SubtitleParser.Factory into WebvttExtractor WebvttExtractor will no longer be wrapped in SubtitleTranscodingExtractor, but instead use SubtitleTranscodingExtractorOutput under the hood. A new constructor will take a boolean parameter to toggle between subtitle parsing during extraction (before the sample queue) or during decoding (after the sample queue). PiperOrigin-RevId: 597604942	2024-01-11 11:13:08 -08:00
lpribanic	e93188fe7f	Crop and pass thumbnails to ImageOutput Image grids are now cropped into tiles. The tiles are provided to ImageOutput at their correct timestamps. PiperOrigin-RevId: 597553029	2024-01-11 07:45:50 -08:00
ibaker	1c1d4d506d	MP3: Correct duration calculation in `XingSeeker` An audio file can only play sound between two PCM samples (the 'start' and 'end' of section of a wave form). Therefore when calculating duration from a count of PCM samples we need to subtract one first (the 'end' sample which has no duration of its own). This only changes durations by one PCM sample (21us - 22us for 44.1kHz sample rate). PiperOrigin-RevId: 596990306	2024-01-09 11:16:49 -08:00
tonihei	ba8f55694a	Set PCM encoding in Flac extractor The extractor knows the PCM encoding of the losslessly encoded data in the samples and should set it in the Format to allow downstream components to use this information. PiperOrigin-RevId: 596974863	2024-01-09 10:28:55 -08:00
ibaker	320dd32be9	Add extractor test for an MP3 file with `Info` header Issue: androidx/media#878 PiperOrigin-RevId: 596957503	2024-01-09 09:30:21 -08:00
christosts	4198f7e242	Add utility testing Player.Listener in transformer tests PiperOrigin-RevId: 596891721	2024-01-09 04:36:45 -08:00
ibaker	62f6c64a91	Rename `test.mp3` test asset to `test-cbr-info-header.mp3` This file is CBR encoded with LAME, so it has an `Info` header (the CBR equivalent to `Xing`). A follow-up change will use this file in `Mp3ExtractorTest`. Issue: androidx/media#878 PiperOrigin-RevId: 595938327	2024-01-05 02:31:20 -08:00
sheenachhabra	e0257f403f	Implement fragmented MP4 (fMP4) in the Mp4Muxer Changes includes; 1. Public API to enable fMP4 and to pass fragment duration. 2. Added `FragmentedMp4Writer`. 3. Added logic to create fragments based on given fragment duration. 4. Write "moov" box only once in the beginning. 3. Add all the required boxes for current implementation. 4. Unit tests for all the new boxes. 5. E2E test for generating fMP4. Note: The output file is un seek-able with this first implementation. PiperOrigin-RevId: 594426486	2023-12-29 03:39:38 -08:00
sheenachhabra	1609928242	Throw exception when B-frames are written to Mp4Muxer Mp4Muxer does not support out of order B-frames. Currently it silently writes out of order B-frames, producing an invalid file (with negative sample durations). Although `Mp4Extractor` is somehow able to process this invalid file and `Exoplayer` is able to play it but that is unexpected. The `sample.mp4` test file contains B frames. Other test files does not contain `H264 video + AAC audio` format hence created a new test file by running `sample.mp4` via `Transformer` after applying some effects. PiperOrigin-RevId: 594016144	2023-12-27 08:32:36 -08:00
tofunmi	0ab7bafa87	Migrate Gaussian Blur Effect to media3. PiperOrigin-RevId: 593164068	2023-12-22 14:13:19 -08:00
tofunmi	1632f37d70	Transformer: Add api to drop audio samples before the first video frame fix for Issue: androidx/media#829 Manual Testing: Viewed the transformer output file of previously problematic case in Exoplayer, Chrome, VLC, Quicktime and Safari and all showed the issue not to occur anymore. The newly produced output file can be found at https://github.com/androidx/media/assets/42352357/fdf105c1-9550-422f-b088-7900f655ac78 PiperOrigin-RevId: 593104752	2023-12-22 06:18:07 -08:00
tofunmi	57d00fcca0	Rename crow_emulator_transformer_output PiperOrigin-RevId: 592873355	2023-12-21 08:56:48 -08:00
ibaker	770ca66fbc	Fix HLS CEA-608 when parsing during extraction HLS distinguishes between 'subtitles' (WebVTT or TTML distributed in separate files with their own playlist) and 'captions' (CEA-608 or 708, distributed muxed into the video file). The format transformation added in `7b762642db` only applies to subtitles and not captions. This change makes the same transformation for caption formats. This resolves an error like: ``` SampleQueueMappingException: Unable to bind a sample queue to TrackGroup with MIME type application/cea-608. ``` Also add two playback tests for HLS CEA-608, one that parses during decoding (old way) and one during extraction (new way). Adding these tests is what alerted me to this issue. PiperOrigin-RevId: 592571284	2023-12-20 08:52:39 -08:00
tonihei	59f01388a6	Use platform constants for PCM 24/32 bit encoding These constants are used when setting up the AudioTrack and should match the definition in the platform. PiperOrigin-RevId: 592564644	2023-12-20 08:16:35 -08:00
ibaker	8b9bf5e7cd	Add DASH CEA-608 playback test This was generated by combining the existing `ts/bbb_2500ms.ts` test asset and a temporary `.srt` file using https://cloud.google.com/transcoder/docs/how-to/captions-and-subtitles This doesn't directly reproduce the problem fixed by `7ca26f898d`, because the CEA-608 subs are structured differently to the stream I discovered the problem with (from Issue: androidx/media#887). However this test does fail if that fix is reverted after `486230fbd7`. I'm also not able to repro the character duplication reported in Issue: androidx/media#887 by just changing the manifest in this CL. I'm not yet sure on the exact differences between the stream provided on GitHub and this stream. This stream does provide some regression protection, because it currently fails with 'new' subtitle parsing (`DashMediaSource.Factory.experimentalParseSubtitlesDuringExtraction(true)`), though I'm not sure on the exact reason for that yet. PiperOrigin-RevId: 592476328	2023-12-20 01:13:21 -08:00
claincly	7e65cce967	Fix first frame not force rendered This is because currently 1. Player sets a surfaceView to render to 2. Player intializes the renderer 3. MCVR initializes the VideoSinkProvider, by extension VideoGraph But when 1 happens, MCVR doesn't set the surfaceView on the VideoGraph because it's not initialized. Consequently after VideoGraph is initialized, it doesn't have a surface to render to, and thus dropping the first a few frames. Also adds a test for first frame to verify the correct first frame is rendered. PiperOrigin-RevId: 591228174	2023-12-15 06:01:12 -08:00
Copybara-Service	f465efeefd	Merge pull request #864 from v-novaltd:dsparano-exo129_2 PiperOrigin-RevId: 590234505	2023-12-12 09:37:41 -08:00
ibaker	5580b78b13	Fix `VorbisComment.populateMediaMetadata` key comparison to ignore case Issue: androidx/media#876 #minor-release PiperOrigin-RevId: 590215918	2023-12-12 08:43:02 -08:00
ibaker	d356d88c4f	Improve test dump output for role and selection flags PiperOrigin-RevId: 589878576	2023-12-11 11:05:40 -08:00
ibaker	5266c71b3a	Fix handling of multiple HEVC tracks in JPEG motion photos The MP4 data in JPEG motion photos can contain multiple `video/hevc` tracks, but only the first is at a playable frame rate while the others are low-fps, high-res tracks designed for specific use-cases (not direct video playback). ExoPlayer currently selects the unplayable track by default, because it has a higher resolution. This change introduces a flag to `Mp4Extractor` that results in the first video track being marked as `ROLE_FLAG_MAIN`, and all subsequent video tracks `ROLE_FLAG_ALTERNATE` - this then results in the playable lower-res track being selected by default. PiperOrigin-RevId: 589832072	2023-12-11 08:31:13 -08:00
Daniele Sparano	4a17ca7600	Add basic h264 sample unit test to ensure correct sample flags, including key frames	2023-12-11 14:28:11 +00:00
tofunmi	cd346ca14d	Transformer: Add support for transmuxing audio in trim optimization PiperOrigin-RevId: 588711597	2023-12-07 02:18:21 -08:00
ibaker	6360082b87	Add extractor and playback tests for Pixel JPEG motion photo This image has two video tracks in the MP4 data, one is a 'real' video which we want to play by default, and the other is a low-fps video track which isn't intended to be directly played, it's encoded in HEVC for compression and decoding efficiency. This test demonstrates ExoPlayer's current behaviour default extraction and playback, which results in selecting the high-res, low-fps track (actually single sample in this example), instead of playing the actual video. PiperOrigin-RevId: 588068908	2023-12-05 07:46:19 -08:00
ibaker	d4fe3fe318	Add Robolectric e2e test support for HEVC content PiperOrigin-RevId: 588055594	2023-12-05 06:52:39 -08:00
michaelkatz	d1e38abf93	Limit processing Opus decode-only frames by seek-preroll in offload As Opus decoders skip some bytes prior to playback during a seek, the renderer for bypass playback should send samples to the decoder even if they would be decode-only. However, the renderer should not send samples with time preceding that range. This change adds that constraint. #minor-release PiperOrigin-RevId: 588014983	2023-12-05 03:44:37 -08:00
tonihei	b1541b096f	Map VORBIS channel layout to Android layout Both the extension OPUS decoder and the OMX/C2 MediaCodec implementations for OPUS and VORBIS decode into the channel layout defined by VORBIS. See https://www.xiph.org/vorbis/doc/Vorbis_I_spec.html#x1-140001.2.3 While this is technically correct for a stand-alone OPUS or VORBIS decoder, it doesn't match the channel layout expected by Android. See https://developer.android.com/reference/android/media/AudioFormat#channelMask The fix is to apply the channel mapping after decoding if needed. Also add e2e tests with audio dumps for the extension renderer, including a new 5.1 channel test file. Issue: google/ExoPlayer#8396 #minor-release PiperOrigin-RevId: 588004832	2023-12-05 03:01:32 -08:00
huangdarwin	a2a4ef5f40	Effect: Fix overlay alpha handling. Before, a translucent overlay over an opaque video would result in a translucent output. This is not consistent with physical properties of light (if putting a translucent object in front of an opaque object, you can't see behind the opaque object). Using the mixing properties from DefaultVideoCompositor. PiperOrigin-RevId: 586636275	2023-11-30 04:43:21 -08:00
ibaker	28c210686f	Don't include null text or bitmaps in `Cue.toBundle()` `fromBundle` doesn't distinguish between `FIELD_BITMAP` and `FIELD_TEXT` being present with a null value, or being absent, so we might as well avoid including them when the value is null. I've separated this from a later change to add `Cue.toSerializableBundle` which will also skip setting a bitmap value into the `Bundle` if `this.bitmap == null`. This is partly because it results in changes to a lot of extractor test dump files, and it's easier to review that as a separate change. PiperOrigin-RevId: 586626141	2023-11-30 03:58:33 -08:00
samrobinson	aef2fcb900	Plumb EditedMediaItem for audio preview. PiperOrigin-RevId: 584865962	2023-11-23 05:13:32 -08:00
tofunmi	2d77e4d22c	Implement trim optimization in Transformer PiperOrigin-RevId: 584622392	2023-11-22 07:28:34 -08:00
christosts	ec08db458e	MidiExtractor: mark only the first sample as key-frame This change fixes a bug with seeking forward in MIDI. When seeking forward, the progressive media period attempts to seek within the sample queue, if a key-frame exists before the seeking position. With MIDI, however, we can only skip Note-On and Note-Off samples and all other samples must be sent to the MIDI decoder. When seeking outside the sample queue, the MidiExtractor already instructs the player to start from the beginning of the MIDI input. With this change, only the first output sample is a key-frame, thus the progressive media period can no longer seek within the sample queue and is forced to seek from the MIDI input start always. Issue: androidx/media#704 #minor-release PiperOrigin-RevId: 584321443	2023-11-21 07:39:46 -08:00
samrobinson	79fd3365fa	Use VolumeScalingAudioProcessor in audio effect preview test. Dump files changes are easier to understand with a processor that keeps buffer sizes the same. PiperOrigin-RevId: 584320606	2023-11-21 07:35:01 -08:00
Copybara-Service	fde142d66e	Merge pull request #793 from androidx:mpegh_extractor_changes PiperOrigin-RevId: 584291984	2023-11-21 05:07:27 -08:00
Rohit Singh	4c3ca9b6f4	update mhm1 mp4 samples and corresponding extractor dumps	2023-11-21 12:12:23 +00:00
andrewlewis	7b9aa87344	Allow allocating more buffers when transmuxing When transmuxing, the `EncodedSampleExporter` maintains a queue of input buffers that get filled with encoded data by the asset loader. The number of buffers was limited to avoid using more and more memory if producer (asset loader) gets far ahead of the consumer (exporter). Previously this limit was fixed at 10 buffers, but increasing the number of buffers can make some transmux operations much faster. Allow allocating between a min and max number of buffers, and also set a target allocation size beyond which new buffers can't be allocated. This allows audio formats which require many small buffers to be processed more quickly, while preventing allocating too much memory for hypothetical very high bitrate formats. 'Remove video' edits on local videos in particular get much faster, because audio buffers are very short and there are lots of them. With a sample 10 minute video, a 'remove video' edit took 2 seconds (36 seconds before this change). With a sample 1 minute removing video took 0.25 seconds after this change (2.5 seconds before). The speed improvement is smaller for other types of edits that retain the video track. Transmuxing a 10 minute video retaining the video track took 26 seconds (40 seconds before). PiperOrigin-RevId: 583390284	2023-11-17 08:11:25 -08:00
samrobinson	d5fbf0007b	Migrate to Util.durationUsToSampleCount in transformer audio. PiperOrigin-RevId: 582700443	2023-11-15 09:15:10 -08:00
lpribanic	64d2210b79	Add DashPlayback test for loading thumbnail grids This test instantiates an image renderer, selects an image track and plays a thumbnail grid. PiperOrigin-RevId: 582696009	2023-11-15 09:03:19 -08:00

1 2 3 4 5 ...

334 Commits