dirkf
c94a459a24
[utils] Sanitize look-alike Unicode glyphs in non-ID filename fields when --restrict-filenames
...
Implements https://github.com/ytdl-org/youtube-dl/issues/31216#issuecomment-1236102822 , which has a test.
2022-10-11 12:18:12 +00:00
dirkf
6e2626f092
[JSInterp] Improve separation logic
...
Based on 0468a3b325
2022-10-11 05:58:10 +01:00
dirkf
c282e5f8d7
[ZDF] Overhaul ZDF extractors
...
* pull some yt-dlp changes into ZDFBaseIE._extract_format()
* add test cases from yt-dlp to ZDFIE
* fix crash in ZDFIE._extract_mobile() when object had no `formitaeten`
* improve title extraction in ZDFChannelIE (remove trailing station ident)
* avoid extracting non-video playlist items (fixes #31149 )
2022-10-11 00:05:17 +01:00
Xiyue
82e4eca711
[motherless] Fixed the broken uploader_id in the extractor ( #31243 )
...
* Fixed the broken uploader_id in the extractor.
* Make uploader_id RE looser
* Fix uploader_id in test Motherless_3
* Fix group pagination
* # coding: utf-8
Co-authored-by: Andy Xuming <xuminic@gmail.com>
Co-authored-by: dirkf <fieldhouse@gmx.net>
2022-10-10 23:52:48 +01:00
dirkf
1b1442887e
[manyvids] Improve extraction ( #31172 )
...
* extract all formats from page
* extract description, uploader, views, likes
* downrate previews
* fix tests
* use txt_or_none()
2022-10-10 19:26:32 +01:00
dirkf
22127b271c
[NRK] Remove explicit Accept-Encoding header that invites Brotli
...
Fixes #31285
2022-10-10 17:41:40 +00:00
coletdjnz
d35557a75d
[Telegraaf] Use mobile GraphQL API endpoint
...
Workaround for Cloudflare 403
Fixes https://github.com/yt-dlp/yt-dlp/issues/5000
Authored by: coletdjnz
2022-10-04 11:43:08 +01:00
pukkandan
7009bb9f31
[jsinterp] Workaround operator associativity issue
...
* temporary fix for player 5a3b6271 [1]
1. https://github.com/yt-dlp/yt-dlp/issues/4635#issuecomment-1235384480
2022-09-03 00:53:56 +01:00
dirkf
218c423bc0
[cache] Add cache validation by program version, based on yt-dlp
2022-09-01 13:28:30 +01:00
dirkf
55c823634d
[jsinterp] Handle new YT players 113ca41c, c57c113c
...
* add NaN
* allow any white-space character for `after_op`
* align with yt-dlp f26af78a8ac11d9d617ed31ea5282cfaa5bcbcfa (charcodeAt and bitwise overflow)
* allow escaping in regex, fixing player c57c113c
2022-09-01 10:57:12 +01:00
dirkf
4050e10a4c
[options] Document that postprocessing is not forced by --postprocessor-args
...
Resolves #30307
2022-08-29 13:02:17 +01:00
dirkf
ed5c44e7b7
[compat] Replace deficient ChainMap class in Py3.3 and earlier
...
* fix version check
2022-08-26 12:22:01 +01:00
dirkf
0f6422590e
[compat] Replace deficient ChainMap class in Py3.3 and earlier
2022-08-26 10:24:42 +01:00
dirkf
4c6fba3765
[jsinterp] Improve try/catch/finally support
2022-08-26 08:51:17 +01:00
dirkf
d619dd712f
[jsinterp] Fix bug in operator precedence
...
* from 164b03c486
* added tests
2022-08-25 12:16:10 +01:00
dirkf
573b13410e
[YouTube] Improve error check for n-sig processing
2022-08-25 12:14:59 +01:00
dirkf
66e58dccc2
[core] Avoid processing empty format list after removing bad formats
...
* also ensure compat encoding of error strings
2022-08-21 00:45:06 +01:00
dirkf
556862bc91
[utils] Ensure RFC3986 encoding result is unicode
2022-08-21 00:45:06 +01:00
gudata
a8d5316aaf
[infoq] Avoid crash if the page has no mp3Form
...
* proposed fix for issue #31131 , aligns with yt-dlp
Co-authored-by: dirkf <fieldhouse@gmx.net>
2022-08-19 21:00:21 +01:00
dirkf
fd3f3bebd0
[uktvplay] Support domain without .uktv
2022-08-19 19:11:08 +01:00
dirkf
46b8ae2f52
[jsinterp] Clean up and pull yt-dlp style
...
* add compat_re_Pattern
* improve compat_collections_chain_map
* use class JS_Undefined
* remove unused code
2022-08-19 15:34:33 +01:00
dirkf
538ec65ba7
[jsinterp] Handle regexp literals and throw/catch execution ( #31182 )
...
* based on f6ca640b12
, thanks pukkandan
* adds parse support for regexp flags
2022-08-19 11:45:04 +01:00
dirkf
b0a60ce203
[jsinterp] Improve JS language support ( #31175 )
...
* operator ??
* operator ?.
* operator **
* accurate operator functions
* `undefined` handling
* object literals {a: 1, "b": expr}
* more tests for weird JS comparisons: see https://github.com/ytdl-org/youtube-dl/issues/31173#issuecomment-1217854397 .
2022-08-17 14:22:02 +01:00
dirkf
e52e8b8111
[postprocessor] Don't replace existing value with null metadata parsed from title
2022-08-15 16:45:04 +01:00
dirkf
d231b56717
[jsinterp] Overhaul JSInterp to handle new YT players 4c3f79c5, 324f67b9 ( #31170 )
...
* back-port from yt-dlp 8f53dc44a0cc1c2d98c35740b9293462c080f5d0, thanks pukkandan
* also support void, improve <</>> precedence, improve expressions in comma-list
* add more tests
2022-08-14 18:45:45 +01:00
dirkf
e6a836d54c
[core] Make --max-downloads ...
stop immediately on reaching the limit
...
Based on and closes #26638 .
2022-08-10 15:37:59 +01:00
dirkf
deee741fb1
[test, etc] Improve download test logs; also clean up some new flake8 issues ( #31153 )
...
* [test] Identify testcase errors better
* [test] Identify download errors better
* [extractor/minds] Linter
* [extractor/aes] Linter
2022-08-09 21:05:00 +01:00
Wes
adb5294177
[aenetworks] Update _THEPLATFORM_KEY and _THEPLATFORM_SECRET ( #29749 )
...
Fixes ytdl-org/youtube-dl#29300
2022-07-30 02:10:00 +01:00
Kyraminol Endyeran
5f5c127ece
[VVVVID] Support video/dash types ( #31060 )
...
Resolves #31030 .
2022-07-12 00:35:40 +01:00
dirkf
090acd58c1
[options] Improve be35e53
(--match-/reject-title parameter value)
...
Resolves #31064 .
2022-07-03 20:05:21 +01:00
dirkf
a03b9775d5
[Mediaset] Support player version number in URL pattern
...
Ref: https://github.com/yt-dlp/yt-dlp/issues/4141
2022-06-26 14:24:06 +01:00
dirkf
8a158a936c
[NHK] Use new API URL
2022-06-15 18:28:19 +01:00
dirkf
cc179df346
[XHamster] Support xhday.com alias, extract uploader_id
...
* support xhday.com alias for xhamster.com (resolves #31023 )
Authored by: dirkf
* extract `uploader_id`:
from 908b56eaf7
(PR https://github.com/yt-dlp/yt-dlp/pull/844 )
Authored by: octotherp
2022-06-12 14:10:38 +01:00
pukkandan
0700fde640
[utils, etc] Kill child processes when yt-dl is killed
...
* derived from PR #26592 , closes #26592
Authored by: Unrud
2022-06-10 19:57:46 +01:00
dirkf
811c480f7b
[YouTube] Support JSON3 subtitle format
...
* subtitle tests updated to match
2022-06-09 15:25:23 +01:00
dirkf
530f4582d0
[HRFernsehen] Back-port new extractor from yt-dlp
...
Closes #26445 , where this was originally proposed.
2022-06-06 19:29:48 +01:00
pukkandan
1baa0f5f66
[utils] Escape URL while sanitizing
...
Closes #31008 , #yt-dlp/263
While this fixes the issue in question, it does not try to address the root-cause of the problem
Refer: 915f911e365736227e134ad654601443dbfd7ccb, f5fa042c82300218a2d07b95dd6b9c0756745db3
2022-06-06 16:03:04 +01:00
dirkf
04fd3289d3
[YouPorn] Improve upload_date
extraction
...
See https://github.com/yt-dlp/yt-dlp/issues/2701#issuecomment-1034341883
2022-05-28 13:54:32 +01:00
dirkf
52c3751df7
[utils] Enable ALPN in HTTPS to satisfy broken servers
...
See https://github.com/yt-dlp/yt-dlp/issues/3878
2022-05-28 13:52:51 +01:00
dirkf
187a48aee2
[YouTube] Handle player c5a4daa1 with indirect n-function definition
...
* resolves #30976
2022-05-24 15:43:56 +01:00
Jacob Chapman
be35e5343a
Update options.py
2022-05-20 05:25:54 +01:00
dirkf
c3deca86ae
[wat.tv] Add version pver
to metadata API call
...
Resolves #30959 .
2022-05-19 17:41:48 +00:00
dirkf
c7965b9fc2
[NHK] Support alphabetic characters in 7-char NhkVod IDs ( #29682 )
2022-05-09 18:54:41 +01:00
dirkf
e27d8d819f
[streamcz] Remove empty '{}'.format()
for Py2.6
...
Use `'-join()'` here, or `{0}`, ..., in general.
2022-04-29 13:36:02 +01:00
Árni Dagur
ebc627847c
[KTH] Add new extractor for KTH play ( #30885 )
...
* Implement extractor for KTH play
* Make KTH Play url regex more relaxed
2022-04-28 10:18:10 +01:00
dirkf
a0068bd6be
[Youtube] Fix "n" descrambling for player fae06c11
...
Resolves #30856 .
2022-04-15 16:07:09 +01:00
nixxo
871645a4a4
[RAI] Fix extraction of http formats
...
From https://github.com/yt-dlp/yt-dlp/pull/3272
Closes https://github.com/yt-dlp/yt-dlp/issues/3270
Authored by: nixxo
2022-04-05 15:21:59 +01:00
nixxo
1f50a07771
[RAI] Extend formats with direct http mp4 link (PR #27990 )
...
* initial support for creating direct mp4 link
* improved regexes and info extraction
* added "connection: close" to request headers
* updated to https://github.com/yt-dlp/yt-dlp/pull/208
2022-04-05 15:21:59 +01:00
nixxo
9e5ca66f16
[RAI] Added checks for DRM protected content (PR #27657 )
...
reviewed by pukkandan (https://github.com/yt-dlp/yt-dlp/pull/150 )
2022-04-05 15:21:59 +01:00
lihan7
17d295a1ec
[extractor/bilibili] Fix path "/audio/auxxxxx" download return 403
2022-04-01 00:46:34 +01:00
dirkf
49c5293014
Ignore --external-downloader-args if --external-downloader was rejected
...
... and generate warning
2022-03-25 14:47:26 +00:00
df
6508688e88
Make default upload_/release_date a compat_str
...
Ensures download tests pass in Python 2 as well as 3; also
add YoutubeDL tests for timestamp -> upload_date etc.
2022-02-26 10:29:42 +00:00
dirkf
4194d253c0
Avoid skipping ID when unlisted_hash is numeric
...
Pattern needed a non-greedy match; also replaced a redundant test with one for this, issue 29690
2022-02-26 10:29:42 +00:00
dirkf
f8e543c906
[Alsace20TV] Add new extractors Alsace20TVIE, Alsace20TVEmbedIE
2022-02-24 18:43:47 +00:00
dirkf
c4d1738316
[CPAC] Add extractor for Canadian Parliament
...
CPACIE: single episode
CPACPlaylistIE: playlists and searches
2022-02-24 18:27:57 +00:00
dirkf
1f13ccfd7f
Fixed groups() call on potentially empty regex search object ( #30676 )
...
* Fixed groups() call on potentially empty regex search object.
- https://github.com/ytdl-org/youtube-dl/issues/30521
* minimising lines changed
Co-authored-by: yayorbitgum <50963144+yayorbitgum@users.noreply.github.com>
2022-02-24 18:26:58 +00:00
marieell
923292ba64
[aliexpress] Fix test case
2022-02-24 13:44:52 +00:00
Lesmiscore (Naoya Ozaki)
782bfd26db
[bigo] add support for bigo.tv ( #30635 )
...
* [bigo] add support for bigo.tv
* [bigo] prepend "Bigo says"
* title fallback
* add error for invalid json data
2022-02-24 13:34:32 +00:00
Vladimir Stavrinov
3472227074
[rutv] fix vbr for empty string value ( #30623 )
...
* [rutv] use str_to_int() (thx dirkf)
2022-02-14 17:54:31 +00:00
Petr Vaněk
bf23bc0489
add missing __future__ import unicode_literals
2022-02-14 07:07:05 +00:00
Petr Vaněk
85bf26c1d0
resolve problem with unpacking operator for <py3.5
2022-02-14 07:07:05 +00:00
Petr Vaněk
d8adca1b66
[streamcz] test fixes and one additional test
2022-02-14 07:07:05 +00:00
Petr Vaněk
d02064218b
do not use f-strings
2022-02-14 07:07:05 +00:00
Petr Vaněk
b1297308fb
avoid traverse_obj function
2022-02-14 07:07:05 +00:00
Petr Vaněk
8088ce036a
revert: use _match_valid_url function
2022-02-14 07:07:05 +00:00
Petr Vaněk
29f7bfc4d7
[streamcz] cherry-pick from yt-dlp
...
Cherry-picked-from: 7d449fff5346 ("[streamcz] Fix extractor (#1616 )")
2022-02-14 07:07:05 +00:00
dirkf
74f8cc48af
[extractor/videa] Back-port from yt-dlp PRs 463+1028
...
Authored by: nyuszika7h
2022-02-11 12:43:26 +00:00
kikuyan
8ff961d10f
[extractor/videa] fix extraction in Py2
...
Fixes #30416
2022-02-11 12:43:26 +00:00
dirkf
266b6ef185
[BBC] Also allow PID with leading 'l' (live?)
2022-02-09 21:21:59 +00:00
dirkf
825d3426c5
[Nuvid] Use site JSON for video details ( #29332 )
...
Back-port yt-dlp PR 1022 onto PR #17890 and update
Video details aren't in the original HTML now but populated by async JS
Co-authored by: u-spec-png
Co-authored by: vidaritos
2022-02-09 02:40:34 +00:00
dirkf
47b0c8697a
[ARD] Back-port subtitle extraction from yt-dlp PR 2409
...
Authored by: fstirlitz
Fixes #30543
Closes #17766 (thanks ngdio)
2022-02-07 13:47:38 +00:00
Seonghyeon Cho
734dfbb4e3
Remove redundant assigning format_id
2022-02-05 03:04:35 +00:00
df
ddc080a562
Add ArteTVCategoryIE to support category playlists
2022-02-05 03:02:56 +00:00
Abdullah Ibn Fulan
16a3fe2ba6
Updated Album URL regex
...
Mistakenly forgot to edit a line in last commit.
Co-authored-by: dirkf <fieldhouse@gmx.net>
2022-02-05 02:53:23 +00:00
Abdullah Ibn Fulan
c820a284a2
[extractor/audiomack] Updated URL regex, corrected invalid testcases, fixed bug
...
Co-authored-by: dirkf <fieldhouse@gmx.net>
2022-02-05 02:53:23 +00:00
dirkf
58babe9af7
Support __INITIAL_DATA__ with stringified JSON
...
Add test and fix test for bbcthreeConfig
2022-02-05 02:51:46 +00:00
df
6d4932f023
Try for timestamp, description from window.__INITIAL_DATA__ pages
2022-02-05 02:51:46 +00:00
dirkf
92d73ef393
[niconico] Implement heartbeat for download
2022-02-05 02:47:21 +00:00
dirkf
91278f4b6b
[niconico] Back-port extractor from yt-dlp
...
Add Nico search extractors, fix extraction
2022-02-05 02:47:21 +00:00
dirkf
584715a803
[applepodcasts] Extract default thumbnail image
2022-02-05 02:32:45 +00:00
dirkf
e00b0eab1e
[applepodcasts] Improve format extraction
...
Set acodec and vcodec, etc, to avoid breaking, eg, bestaudio
2022-02-05 02:32:45 +00:00
dirkf
005339d637
[applepodcasts] Support new AMP-ish page structure
2022-02-05 02:32:45 +00:00
Chris Rose
23ad6402a6
xvideos: Fix for #30271
2022-02-05 02:24:51 +00:00
dirkf
9642344965
Fix tests for working IEs; disable obsolete WDRMobile
2022-02-05 02:22:45 +00:00
dirkf
568c7005d5
Fix WDRMaus; extend URL matching for other Maus pages; improve ID extraction
2022-02-05 02:22:45 +00:00
dirkf
5cb4833f40
Update URPlayIE extractor for Next.js page format, with subtitles
2022-02-05 02:16:53 +00:00
dirkf
5197336de6
Support more deeply nested ptmd_path with test, update tests
2022-02-05 02:14:35 +00:00
dirkf
01824d275b
Additional tweaks: allow any .ndr.de, simplify quote match
2022-02-05 02:12:44 +00:00
dirkf
39a98b09a2
Fix NDR, NJoy tests
2022-02-05 02:12:44 +00:00
dirkf
f0a05a55c2
NJoy: improve extraction of NDR id, description, etc with current page formats
2022-02-05 02:12:44 +00:00
dirkf
4186e81777
NDR: improve extraction of NDR id, description, etc with current page formats
2022-02-05 02:12:44 +00:00
dirkf
b494824286
Support Tele5 pages with Discovery Networks format instead of JWPlatform
2022-02-05 02:08:11 +00:00
dirkf
8248133e5e
Back-port yt-dlp Viki extractor
...
From https://github.com/yt-dlp/yt-dlp/pull/2540
2022-02-04 15:49:12 +00:00
dirkf
27dbf6f0ab
Return the item itself if playlist has one entry
...
Removes playlist spam from log
2022-02-04 14:28:50 +00:00
dirkf
61d791726f
Find TV2DK Kaltura ID in Nuxt.js page format
2022-02-04 14:28:50 +00:00
pukkandan
0c0876f790
[youtube:search] Add tests
2022-02-04 11:09:18 +00:00
dirkf
5add3f4373
Merge branch 'pukkandan-yt-searchurl' into yt-dl-master
...
Closes #27749
2022-02-04 03:50:32 +00:00
pukkandan
78ce962f4f
[youtube] Support channel search
...
Code from cd684175ad
2022-02-03 01:02:58 +00:00
dirkf
41f0043983
Avoid crashing if n-sig decode fails
2022-02-02 14:25:03 +00:00
dirkf
34c06b16f5
Support Youtube Shorts URL format
2022-02-01 14:40:20 +00:00