Zenon Mousmoulas
a2fd63ce22
JSON-LD: Tweak (News)Article description extraction
...
Let JSON-LD extract description from articleBody and fall back to
description field when processing (News)Article typed nodes
2021-11-12 09:35:50 +02:00
Zenon Mousmoulas
d6469de1da
Extend TestInfoExtractor.test_search_json_ld_realworld to cover @graph
...
expressing JSON-LD implicit default graph
* Refactor tests in a list of 3-tuples: test html string, expected dict,
keyword args for InfoExtractor._search_json_ld
* Adapt test code accordingly
* Add test for @graph expressing JSON-LD implicit default graph
2021-11-12 09:35:50 +02:00
Zenon Mousmoulas
77e8f5353c
JSON-LD: Support top-level @graph expressing implicit default graph
...
Per W3C JSON-LD v1.1 §4.9 (non-normative ref):
When a JSON-LD document's top-level structure is a map that contains
no other keys than @graph and optionally @context (properties that
are not mapped to an IRI or a keyword are ignored), @graph is
considered to express the otherwise implicit default graph.
Support such a structure in InfoExtractor._json_ld parsing:
Wrap the control flow block in a function, which is called recursively
upon such a structure
2021-11-12 09:30:17 +02:00
bopol
a803582717
[peertube] only call description endpoint if necessary ( #29383 )
2021-07-01 06:53:22 +00:00
Remita Amine
7fb9564420
[periscope] pass referer to HLS requests( closes #29419 )
2021-06-28 20:08:39 +01:00
Aleri Kaisattera
379f52a495
[liveleak] Remove extractor ( closes #17625 , closes #24222 ) ( #29331 )
2021-06-21 04:23:50 +07:00
Sergey M․
cb668eb973
[pornhub] Add support for pornhubthbh7ap3u.onion
2021-06-21 04:08:15 +07:00
Sergey M․
751c9ae39a
[pornhub] Detect geo restriction
2021-06-21 03:33:43 +07:00
Sergey M․
da32828208
[pornhub] Dismiss tbr extracted from download URLs ( closes #28927 )
...
No longer reliable
2021-06-21 03:22:37 +07:00
Sergey M․
2ccee8db74
[curiositystream:collection] Extend _VALID_URL ( closes #26326 , closes #29117 )
2021-06-21 01:54:52 +07:00
Sergey M․
47f2f2fbe9
[youtube] Make get_video_info processing more robust ( closes #29333 )
2021-06-21 01:35:21 +07:00
Sergey M․
03ab02730f
[youtube] Workaround for get_video_info request (refs #29333 )
...
See https://github.com/ytdl-org/youtube-dl/issues/29333#issuecomment-864049544
2021-06-21 01:34:27 +07:00
Tianyi Shi
4c77a2e538
[bilibili] Strip uploader name ( #29202 )
2021-06-21 01:03:21 +07:00
bopol
4131703001
[youtube] Update invidious instance list ( #29281 )
2021-06-21 00:42:09 +07:00
Logan B
cc21aebe90
[umg:de] Update GraphQL API URL ( #29304 )
...
Previous one no longer resolves
Co-authored-by: Sergey M. <dstftw@gmail.com>
2021-06-21 00:41:14 +07:00
Sergey M․
57b9a4b4c6
[nrk] Switch psapi URL to https ( closes #29344 )
...
Catalog calls no longer work via http
2021-06-21 00:36:28 +07:00
kikuyan
3a7ef27cf3
[postprocessor/ffmpeg] Show ffmpeg output on error (refs #22680 ) ( #29336 )
2021-06-20 23:58:19 +07:00
kikuyan
a7f61feab2
[egghead] Add support for app.egghead.io ( closes #28404 ) ( #29303 )
...
Co-authored-by: Sergey M. <dstftw@gmail.com>
2021-06-17 10:34:33 +07:00
kikuyan
8fe5d54eb7
[appleconnect] Fix extraction ( #29208 )
2021-06-17 04:12:13 +07:00
kikuyan
d156bc8d59
[orf:tvthek] Add support for MPD formats ( closes #28672 ) ( #29236 )
2021-06-17 04:02:06 +07:00
Sergey M
c2350cac24
[README.md] Update MSVC 2010 redist URL ( closes #29222 )
2021-06-06 05:32:27 +07:00
Sergey M․
b224cf39d5
release 2021.06.06
2021-06-06 01:38:22 +07:00
Sergey M․
5f85eb820c
[ChangeLog] Actualize
...
[ci skip]
2021-06-06 01:32:15 +07:00
Sergey M․
bb7ac1ed66
[facebook] Improve login required detection
2021-06-06 01:16:43 +07:00
Sergey M․
fdf91c52a8
[youporn] Fix formats and view count extraction ( closes #29216 )
2021-06-06 00:11:09 +07:00
Sergey M․
943070af4a
[orf:tvthek] Fix thumbnails extraction ( closes #29217 )
2021-06-05 23:42:25 +07:00
Remita Amine
82f3993ba3
[formula1] fix extraction( closes #29206 )
2021-06-04 17:51:44 +01:00
Sergey M․
d495292852
[ard] Relax _VALID_URL and fix video ids ( closes #22724 , closes #29091 )
2021-05-30 06:14:59 +07:00
Sergey M․
2ee6c7f110
[ustream] Detect https embeds ( closes #29133 )
2021-05-30 03:43:59 +07:00
Sergey M․
6511b8e8d7
[ted] Prefer own formats over external sources ( closes #29142 )
2021-05-30 03:05:22 +07:00
Sergey M․
f3cd1d9cec
[twitch:clips] Improve extraction ( closes #29149 )
2021-05-30 01:49:51 +07:00
phlip
e13a01061d
[twitch:clips] Add access token query to download URLs ( closes #29136 )
2021-05-30 01:47:33 +07:00
Sergey M․
24297a42ef
[youtube] Fix get_video_info request ( closes #29086 , closes #29165 )
2021-05-30 00:36:26 +07:00
Remita Amine
1980ff4550
[vimeo] fix vimeo pro embed extraction( closes #29126 )
2021-05-26 11:04:39 +01:00
Remita Amine
dfbbe2902f
[redbulltv] fix embed data extraction( closes #28770 )
2021-05-17 12:56:49 +01:00
Remita Amine
e1a9d0ef78
[shahid] relax _VALID_URL(closes #28772 , closes #28930 )
2021-05-17 12:37:39 +01:00
Sergey M․
f47627a1c9
release 2021.05.16
2021-05-16 22:55:05 +07:00
Sergey M․
efeb9e0fbf
[ChangeLog] Actualize
...
[ci skip]
2021-05-16 22:40:39 +07:00
Sergey M․
e90a890f01
[playstuff] Add extractor ( closes #28901 , closes #28931 )
2021-05-16 22:31:37 +07:00
Sergey M․
199c645bee
[eroprofile] Skip test
2021-05-16 22:01:51 +07:00
Sergey M․
503a3744ad
[eroprofile] Fix extraction ( closes #23200 , closes #23626 , closes #29008 )
2021-05-16 21:57:21 +07:00
kr4ssi
ef03721f47
[vivo] Add support for vivo.st ( #29009 )
...
Co-authored-by: Sergey M. <dstftw@gmail.com>
2021-05-16 21:46:32 +07:00
Sergey M․
1e8aaa1d15
[generic] Add support for og:audio ( closes #28311 , closes #29015 )
2021-05-16 21:42:38 +07:00
Sergey M․
6423d7054e
[options] Fix thumbnail option group name ( closes #29042 )
2021-05-16 21:34:10 +07:00
Sergey M․
eb5080286a
[phoenix] Fix extraction ( closes #29057 )
2021-05-16 21:21:14 +07:00
Sergey M․
286e01ce30
[generic] Add support for sibnet embeds
2021-05-16 20:50:32 +07:00
Sergey M․
8536dcafd8
[vk] Add support for sibnet embeds ( closes #9500 )
2021-05-16 20:48:24 +07:00
Sergey M․
552b139911
[generic] Add Referer header for direct videojs download URLs ( closes #2879 , closes #20217 , closes #29053 )
2021-05-16 20:29:35 +07:00
Lukas Anzinger
2202cef0e4
[orf:radio] Switch download URLs to HTTPS ( closes #29012 ) ( #29046 )
2021-05-16 19:54:15 +07:00
Sergey M․
a726009987
[blinkx] Remove extractor ( closes #28941 )
...
No longer exists.
2021-05-05 04:12:35 +07:00