Scrubber.py - If a malformed message has Content-Type: with a

charset="quoted-printable" parameter, quopri_encode can be called at t = t.encode(charset, 'replace') which throws an AssertionError. See log message for r. 8213. Caught this. - Malformed RFC 2047 encoded filename= parameter can have a null byte or other garbage in the extension. Cleaned this. - A message with a message/delivery-status part returns None for the part's payload. Checked for a payload before attempting unicode/encode.
author: msapiro <> 2007-06-15 22:22:33 +0000
committer: msapiro <> 2007-06-15 22:22:33 +0000
commit: 04b442cce3d2a853318ebebd13449da3a7cd8c96 (patch)
tree: cd5e4e2ec2f2546a000eafcb48c3dd34653457b6
parent: b2e780a1de9c2263b56ce77b280989fd22bff3cc (diff)
download: mailman2-04b442cce3d2a853318ebebd13449da3a7cd8c96.tar.gz
mailman2-04b442cce3d2a853318ebebd13449da3a7cd8c96.tar.xz
mailman2-04b442cce3d2a853318ebebd13449da3a7cd8c96.zip
1 files changed, 7 insertions, 4 deletions
diff --git a/Mailman/Handlers/Scrubber.py b/Mailman/Handlers/Scrubber.py
index 1ae849b1..de02fd45 100644
--- a/Mailman/Handlers/Scrubber.py
+++ b/Mailman/Handlers/Scrubber.py
@@ -373,7 +373,9 @@ Url : %(url)s
                 partcharset = str(partcharset)
             else:
                 partcharset = part.get_content_charset()
-            if partcharset and partcharset <> charset:
+            # If the part is Content-Type: message/delivery-status, payload is
+            # None so test here.
+            if t and partcharset and partcharset <> charset:
                 try:
                     t = unicode(t, partcharset, 'replace')
                 except (UnicodeError, LookupError, ValueError, AssertionError):
@@ -385,7 +387,7 @@ Url : %(url)s
                 try:
                     # Should use HTML-Escape, or try generalizing to UTF-8
                     t = t.encode(charset, 'replace')
-                except (UnicodeError, LookupError, ValueError):
+                except (UnicodeError, LookupError, ValueError, AssertionError):
                     t = t.encode(lcset, 'replace')
             # Separation is useful
             if isinstance(t, StringType):
@@ -436,7 +438,7 @@ def save_attachment(mlist, msg, dir, filter_html=True):
     # i18n file name is encoded
     lcset = Utils.GetCharSet(mlist.preferred_language)
     filename = Utils.oneline(msg.get_filename(''), lcset)
-    fnext = os.path.splitext(filename)[1]
+    filename, fnext = os.path.splitext(filename)
     # For safety, we should confirm this is valid ext for content-type
     # but we can use fnext if we introduce fnext filtering
     if mm_cfg.SCRUBBER_USE_ATTACHMENT_FILENAME_EXTENSION:
@@ -444,6 +446,8 @@ def save_attachment(mlist, msg, dir, filter_html=True):
         ext = fnext or guess_extension(ctype, fnext)
     else:
         ext = guess_extension(ctype, fnext)
+    # Allow only alphanumerics, dash, underscore, and dot
+    ext = sre.sub('', ext)
     if not ext:
         # We don't know what it is, so assume it's just a shapeless
         # application/octet-stream, unless the Content-Type: is
@@ -461,7 +465,6 @@ def save_attachment(mlist, msg, dir, filter_html=True):
     try:
         # Now base the filename on what's in the attachment, uniquifying it if
         # necessary.
-        filename = msg.get_filename()
         if not filename or mm_cfg.SCRUBBER_DONT_USE_ATTACHMENT_FILENAME:
             filebase = 'attachment'
         else:
author	msapiro <>	2007-06-15 22:22:33 +0000
committer	msapiro <>	2007-06-15 22:22:33 +0000
commit	04b442cce3d2a853318ebebd13449da3a7cd8c96 (patch)
tree	cd5e4e2ec2f2546a000eafcb48c3dd34653457b6
parent	b2e780a1de9c2263b56ce77b280989fd22bff3cc (diff)
download	mailman2-04b442cce3d2a853318ebebd13449da3a7cd8c96.tar.gz mailman2-04b442cce3d2a853318ebebd13449da3a7cd8c96.tar.xz mailman2-04b442cce3d2a853318ebebd13449da3a7cd8c96.zip