<img alt="" src="lib/header1.jpg" style="padding : 1px;"/> = dictionary of attribute = {'alt': '', 'style': 'padding : 1px;', 'src': 'lib/header1.jpg'}
<a> fdsjfdsjf;lkjdsf </a> = <tag attribute1="xxxxx" attr2="yyyy">text</tag>
code for x in soup.find_all('img') : print(x) print(x['src']) x['src'] = x.get('src').strip('lib/') print(x['src']) print(x.attrs) | Output C:\Users\jkriker\Google Drive\FTUwebsite\O0O000OOO00O\migrate framed to unframed\cleaned <img alt="" src="lib/header1.jpg" style="padding : 1px;"/> lib/header1.jpg header1.jpg {'alt': '', 'style': 'padding : 1px;', 'src': 'header1.jpg'} <img alt="" src="lib/NewItem460.png" style="padding : 1px;"/> lib/NewItem460.png NewItem460.png {'alt': '', 'style': 'padding : 1px;', 'src': 'NewItem460.png'} |
---|
Regular expressions for python | |
---|---|
\/\/f.*uk\\ | |
<ri.*\"\s\/> | |
<a.*\"\s/> | |