import re from common_p3 import download def crawl_sitemap(url): sitemap = download(url) links = re.findall('<loc>(.*?)</loc>',sitemap) print('links=',links) for link in links: print('link=',link) html = download(link) return crawl_sitemap('ht…
#!/usr/python3 import re import urllib.request def gethtml(url): page=urllib.request.urlopen(url) html=page.read() return html def getimg(html): reg = r'src="(.*?\.jpg)"' img=re.compile(reg) html=html.decode('utf-8') # python3 imglist=re.findall…
一劳永逸解决:TypeError: cannot use a string pattern on a bytes-like object TypeError: cannot use a string pattern on a bytes-like object python2和python3之间切换,难免会碰到一些问题,有些方法比如re模块的findall要求传入的是字符串格式的参数,urllib.request.urlopen(url).read()返回的是bytes类型(这个是python3…
response.read() returns an instance of bytes while StringIO is an in-memory stream for text only. Use BytesIO instead. The StringIO and cStringIO modules are gone. Instead, import the io module and use io.StringIO or io.BytesIO for text and data resp…
用pycurl请求指定链接并返回结果时出现 TypeError: string argument expected, got 'bytes'  错误 经过排查问题出现在使用StringIO的write方法上,用BytesIO替代StringIO即可解决问题,代码如下:…
Symbols of String Pattern Matching in Introduction to Algorithms. As it's important to be clear when discussing the problem of string matching, we can use the meticulous symbols used in Introduction to Algorithms. Text: $T[1, ..., n]$. Pattern: $P[1,…
Python 出现 can't use a string pattern on a bytes-like object 学习了:https://www.cnblogs.com/andrewleeeeee/p/6444906.html data = data.decode('utf-8')…
preg_match -- 进行正则表达式匹配.并且只匹配一次,注意与preg_match_all区别. int preg_match( string pattern, string subject [, array matches [, int flags]] ) 在 subject 字符串中搜索与pattern给出的正则表达式相匹配的内容. 如果提供了 matches,则其会被搜索的结果所填充.$matches[0] 将包含与整个模式匹配的文本,$matches[1] 将包含与第一个捕获的括…
#########sample########## sqlite3.OperationalError: Could not decode to UTF-8 column 'logtype' with text 将 with connection.cursor() as c: c.execute("select id,name from district_info where p_id=0") provinces = c.fetchall() 调整为 con = sqlite3.conn…
关于TypeError: strptime() argument 1 must be str, not bytes解析   在使用datetime.strptime(s,fmt)来输出结果日期结果时,出现错误 TypeError: strptime() argument 1 must be str, not bytes 我的源代码如下 def datestr2num(s):return datetime.strptime(s, "%d-%m-%Y").date().weekday()…