Commit Graph

  • fe6c0c74a7 Merge branch 'bugfix/2-fail-on-unavailable-resource' of snegov/nevernote into master master snegov 2019-11-09 14:40:52 +0000
  • 6f917578aa Fix failing on unavailable page resource Maks Snegov 2019-11-09 17:33:58 +0300
  • e6db3f9d1b Fix newlines inside div tag Maks Snegov 2019-10-22 16:45:40 +0300
  • 3b6df3417a Fix link tag with missing rel attribute Maks Snegov 2019-10-22 16:45:13 +0300
  • e843abbc41 Fix python env string Maks Snegov 2019-10-22 16:44:27 +0300
  • 89a8dd90cc Use BS4 for HTML parsing Maks Snegov 2019-10-22 16:05:29 +0300
  • 3198361266 Add --skip-dups option Maks Snegov 2019-10-22 14:39:36 +0300
  • bdceede4f2 Rework fetching URLs from the file Maks Snegov 2019-10-22 12:15:31 +0300
  • 91cddfab7c Refactor code Maks Snegov 2019-10-22 11:43:54 +0300
  • 44b8a17841 Use requests library Maks Snegov 2019-10-22 11:33:06 +0300
  • 56a7032b3e Merge branch 'fix_htmlparser_strict' Maks Snegov 2016-03-10 19:21:48 +0300
  • 26e7176222 strict argument in html.parser.HTMLParser is removed since 3.5 Maks Snegov 2016-03-10 19:15:03 +0300
  • edd12deb37 Merge branch 'devel' Maks Snegov 2016-02-04 09:10:56 +0300
  • 1a6a7b3c9b Merge branch 'b64script' into devel Maks Snegov 2014-10-04 11:08:41 -0400
  • 23f648e1ad limit filename length with 128 chars plus extension Maks Snegov 2014-10-04 10:59:32 -0400
  • c1724b5921 use base64 encoding for embedded scripts Maks Snegov 2014-10-04 03:38:34 +0400
  • 6b3aa602ef add script embedding Maks Snegov 2014-10-04 03:24:38 +0400
  • cf626546e7 use set of content-types for checking Maks Snegov 2014-07-23 08:45:12 +0400
  • fbf52e9544 add script parsing Maks Snegov 2014-07-21 00:46:30 +0400
  • 7ce2bfb97f fix urllib.error.HTTPError print Maks Snegov 2014-07-20 21:42:13 +0400
  • 41e984e1f0 fix urllib.error.HTTPError calls Maks Snegov 2014-07-20 21:40:14 +0400
  • fb3870e9dd skip http error pages Maks Snegov 2014-07-20 17:31:43 +0400
  • 09346f4a70 fix: error with css charsets if no base charset Maks Snegov 2014-07-20 17:31:15 +0400
  • 61d3d84a9c remove unused exception Maks Snegov 2014-07-20 17:30:48 +0400
  • b5ddae0ef8 fix css charset error, add urllib.error.httperror Maks Snegov 2014-07-20 17:04:56 +0400
  • 964e79f97b add gzip encoding support Maks Snegov 2014-07-20 14:03:49 +0400
  • 5c9d04cf3d use file with links as arguments Maks Snegov 2014-07-20 13:48:18 +0400
  • 514b39d287 use default charset utf-8 if not set in headers Maks Snegov 2014-07-20 13:31:20 +0400
  • 45f30ca9de fix: error with urls without scheme ('//ya.ru/index.html') Maks Snegov 2014-07-20 13:30:22 +0400
  • b58188b7b7 remove import Maks Snegov 2014-07-20 13:29:56 +0400
  • c523d025af add duplicate checking Maks Snegov 2014-07-20 13:06:51 +0400
  • a0fbb414a7 write url in the beginning of the file Maks Snegov 2014-07-20 12:17:01 +0400
  • 716c61f6f1 replace http.client with urllib Maks Snegov 2014-07-20 08:09:07 +0400
  • eb2c43f438 ignore UTF-8 errors Maks Snegov 2014-06-25 08:38:43 +0400
  • 6a818f4bb4 fix: error with empty GET urls Maks Snegov 2014-06-23 00:50:21 +0400
  • 594ff71991 add css embedding Maks Snegov 2014-06-22 23:51:18 +0400
  • 754411b6b7 remove unused header from request Maks Snegov 2014-06-22 22:57:42 +0400
  • a7ef8a8b7b separate complete_url function Maks Snegov 2014-06-22 22:56:43 +0400
  • 35f755005d fix: do not work with GET arguments Maks Snegov 2014-06-22 13:12:35 +0400
  • fe69eff79b fix increment postfix in filenames Maks Snegov 2014-06-22 12:38:05 +0400
  • 5c87f241d1 clean title from multiple whitespaces Maks Snegov 2014-06-22 12:24:10 +0400
  • ae63ca6318 skip connRefusedError pictures Maks Snegov 2014-06-22 12:16:10 +0400
  • 36be68d78d fix title with attributes parsing Maks Snegov 2014-06-22 11:59:02 +0400
  • ab03e18ce2 fix relative urls Maks Snegov 2014-06-22 11:48:04 +0400
  • 5b91bef896 add infinite redirects blocking Maks Snegov 2014-06-22 11:47:21 +0400
  • 11de357865 add image embedding Maks Snegov 2014-06-22 11:45:37 +0400
  • 5837451ed7 add url as comment to saved pages Maks Snegov 2014-06-21 20:23:25 +0400
  • e2009e7f08 skip fname duplicates Maks Snegov 2014-06-21 20:09:15 +0400
  • ab9a7e34c1 get title name Maks Snegov 2014-06-21 09:58:47 +0400
  • aead01258d remove never used if condition Maks Snegov 2014-06-21 09:43:12 +0400
  • ae4a9b986e add gzip support Maks Snegov 2014-06-17 22:31:02 +0400
  • 2666d7911a no scheme in url fix Maks Snegov 2014-06-17 22:28:54 +0400
  • 5b05f3e8d0 separate download_content() from get_page() Maks Snegov 2014-06-17 22:26:12 +0400
  • 2f6c877493 fix: URL with no schema will raise error Maks Snegov 2014-06-15 20:16:35 +0400
  • 7e43162920 rewrite HTML title parser Maks Snegov 2014-06-01 23:20:42 +0400
  • af948ff6fc move shell script to deprecated dir Maks Snegov 2014-06-01 21:28:06 +0400
  • 6cbfec5067 set result file name by page title Maks Snegov 2013-12-24 23:00:43 +0400
  • fe61491292 add redirect support Maks Snegov 2013-11-10 00:15:30 +0400
  • 67b7dc81e9 fix charset from response header Maks Snegov 2013-11-09 22:39:35 +0400
  • 5818b0e096 determine charset from response header Maks Snegov 2013-11-09 22:01:43 +0400
  • 36b407e86c init nevernote, python version Maks Snegov 2013-11-09 21:20:53 +0400
  • c8fcdd6241 Fix bug: if result dir (notebook name or todo) doesn't exist, crash/ Maks Snegov 2012-10-07 21:19:07 +0400
  • c08d3da905 Fix bug: if .nevernote doesn't exist in home dir, script will stop Maks Snegov 2012-10-07 17:31:29 +0400
  • 64fa37f0af Move config files to dot-dir in userhome. Add notebook supports. Do not create sub-directories. Maks Snegov 2012-10-07 15:26:44 +0400
  • 1fa1606f0d Initial commit. Maks Snegov 2012-08-19 02:46:46 +0400