-
fe6c0c74a7
Merge branch 'bugfix/2-fail-on-unavailable-resource' of snegov/nevernote into master
master
snegov
2019-11-09 14:40:52 +00:00
-
-
6f917578aa
Fix failing on unavailable page resource
Maks Snegov
2019-11-09 17:33:58 +03:00
-
-
e6db3f9d1b
Fix newlines inside div tag
Maks Snegov
2019-10-22 16:45:40 +03:00
-
3b6df3417a
Fix link tag with missing rel attribute
Maks Snegov
2019-10-22 16:45:13 +03:00
-
e843abbc41
Fix python env string
Maks Snegov
2019-10-22 16:44:27 +03:00
-
89a8dd90cc
Use BS4 for HTML parsing
Maks Snegov
2019-10-22 16:05:29 +03:00
-
3198361266
Add --skip-dups option
Maks Snegov
2019-10-22 14:39:36 +03:00
-
bdceede4f2
Rework fetching URLs from the file
Maks Snegov
2019-10-22 12:15:31 +03:00
-
91cddfab7c
Refactor code
Maks Snegov
2019-10-22 11:43:54 +03:00
-
44b8a17841
Use requests library
Maks Snegov
2019-10-22 11:33:06 +03:00
-
56a7032b3e
Merge branch 'fix_htmlparser_strict'
Maks Snegov
2016-03-10 19:21:48 +03:00
-
-
26e7176222
strict argument in html.parser.HTMLParser is removed since 3.5
Maks Snegov
2016-03-10 19:15:03 +03:00
-
-
edd12deb37
Merge branch 'devel'
Maks Snegov
2016-02-04 09:10:56 +03:00
-
-
1a6a7b3c9b
Merge branch 'b64script' into devel
Maks Snegov
2014-10-04 11:08:41 -04:00
-
-
23f648e1ad
limit filename length with 128 chars plus extension
Maks Snegov
2014-10-04 10:59:32 -04:00
-
c1724b5921
use base64 encoding for embedded scripts
Maks Snegov
2014-10-04 03:38:34 +04:00
-
-
6b3aa602ef
add script embedding
Maks Snegov
2014-10-04 03:24:38 +04:00
-
cf626546e7
use set of content-types for checking
Maks Snegov
2014-07-23 08:45:12 +04:00
-
fbf52e9544
add script parsing
Maks Snegov
2014-07-21 00:46:30 +04:00
-
7ce2bfb97f
fix urllib.error.HTTPError print
Maks Snegov
2014-07-20 21:42:13 +04:00
-
41e984e1f0
fix urllib.error.HTTPError calls
Maks Snegov
2014-07-20 21:40:14 +04:00
-
fb3870e9dd
skip http error pages
Maks Snegov
2014-07-20 17:31:43 +04:00
-
09346f4a70
fix: error with css charsets if no base charset
Maks Snegov
2014-07-20 17:31:15 +04:00
-
61d3d84a9c
remove unused exception
Maks Snegov
2014-07-20 17:30:48 +04:00
-
b5ddae0ef8
fix css charset error, add urllib.error.httperror
Maks Snegov
2014-07-20 17:04:56 +04:00
-
964e79f97b
add gzip encoding support
Maks Snegov
2014-07-20 14:03:49 +04:00
-
5c9d04cf3d
use file with links as arguments
Maks Snegov
2014-07-20 13:48:18 +04:00
-
514b39d287
use default charset utf-8 if not set in headers
Maks Snegov
2014-07-20 13:31:20 +04:00
-
45f30ca9de
fix: error with urls without scheme ('//ya.ru/index.html')
Maks Snegov
2014-07-20 13:30:22 +04:00
-
b58188b7b7
remove import
Maks Snegov
2014-07-20 13:29:56 +04:00
-
c523d025af
add duplicate checking
Maks Snegov
2014-07-20 13:06:51 +04:00
-
a0fbb414a7
write url in the beginning of the file
Maks Snegov
2014-07-20 12:17:01 +04:00
-
716c61f6f1
replace http.client with urllib
Maks Snegov
2014-07-20 08:09:07 +04:00
-
eb2c43f438
ignore UTF-8 errors
Maks Snegov
2014-06-25 08:38:43 +04:00
-
6a818f4bb4
fix: error with empty GET urls
Maks Snegov
2014-06-23 00:50:21 +04:00
-
594ff71991
add css embedding
Maks Snegov
2014-06-22 23:51:18 +04:00
-
754411b6b7
remove unused header from request
Maks Snegov
2014-06-22 22:57:42 +04:00
-
a7ef8a8b7b
separate complete_url function
Maks Snegov
2014-06-22 22:56:43 +04:00
-
35f755005d
fix: do not work with GET arguments
Maks Snegov
2014-06-22 13:12:35 +04:00
-
fe69eff79b
fix increment postfix in filenames
Maks Snegov
2014-06-22 12:38:05 +04:00
-
5c87f241d1
clean title from multiple whitespaces
Maks Snegov
2014-06-22 12:24:10 +04:00
-
ae63ca6318
skip connRefusedError pictures
Maks Snegov
2014-06-22 12:16:10 +04:00
-
36be68d78d
fix title with attributes parsing
Maks Snegov
2014-06-22 11:59:02 +04:00
-
ab03e18ce2
fix relative urls
Maks Snegov
2014-06-22 11:48:04 +04:00
-
5b91bef896
add infinite redirects blocking
Maks Snegov
2014-06-22 11:47:21 +04:00
-
11de357865
add image embedding
Maks Snegov
2014-06-22 11:45:37 +04:00
-
5837451ed7
add url as comment to saved pages
Maks Snegov
2014-06-21 20:23:25 +04:00
-
e2009e7f08
skip fname duplicates
Maks Snegov
2014-06-21 20:09:15 +04:00
-
ab9a7e34c1
get title name
Maks Snegov
2014-06-21 09:58:47 +04:00
-
aead01258d
remove never used if condition
Maks Snegov
2014-06-21 09:43:12 +04:00
-
ae4a9b986e
add gzip support
Maks Snegov
2014-06-17 22:31:02 +04:00
-
2666d7911a
no scheme in url fix
Maks Snegov
2014-06-17 22:28:54 +04:00
-
5b05f3e8d0
separate download_content() from get_page()
Maks Snegov
2014-06-17 22:26:12 +04:00
-
2f6c877493
fix: URL with no schema will raise error
Maks Snegov
2014-06-15 20:16:35 +04:00
-
7e43162920
rewrite HTML title parser
Maks Snegov
2014-06-01 23:20:42 +04:00
-
af948ff6fc
move shell script to deprecated dir
Maks Snegov
2014-06-01 21:28:06 +04:00
-
-
6cbfec5067
set result file name by page title
Maks Snegov
2013-12-24 23:00:43 +04:00
-
fe61491292
add redirect support
Maks Snegov
2013-11-10 00:15:30 +04:00
-
67b7dc81e9
fix charset from response header
Maks Snegov
2013-11-09 22:39:35 +04:00
-
5818b0e096
determine charset from response header
Maks Snegov
2013-11-09 22:01:43 +04:00
-
36b407e86c
init nevernote, python version
Maks Snegov
2013-11-09 21:20:53 +04:00
-
c8fcdd6241
Fix bug: if result dir (notebook name or todo) doesn't exist, crash/
Maks Snegov
2012-10-07 21:19:07 +04:00
-
c08d3da905
Fix bug: if .nevernote doesn't exist in home dir, script will stop
Maks Snegov
2012-10-07 17:31:29 +04:00
-
64fa37f0af
Move config files to dot-dir in userhome. Add notebook supports. Do not create sub-directories.
Maks Snegov
2012-10-07 15:26:44 +04:00
-
1fa1606f0d
Initial commit.
Maks Snegov
2012-08-19 02:46:46 +04:00