[ESP-ENG] Titulos y enlaces en developer-tech || Titles and links in developer-tech

0 comments

pynomiems3 K3 years agoPeakD


Imagen diseñada con canva || Image designed with canva

import httpx
from selectolax.parser import HTMLParser

headers={'user-agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/87.0.4280.141 Safari/537.36 Edg/87.0.664.75'}

url_list=['https://www.developer-tech.com/categories/developer-ai/',
'https://www.developer-tech.com/categories/developer-databases/',
'https://www.developer-tech.com/categories/development-tools/',
'https://www.developer-tech.com/categories/developer-hacking-security/',
'https://www.developer-tech.com/categories/developer-platforms/']

for url_list in url_list:

client=httpx.Client(headers=headers)
developer_tech=client.get(url_list).text

with open('developer_tech.html',mode='w',encoding="utf-8") as archive:

    archive.write(developer_tech)
    archive.close()

    f=open('developer_tech.html',encoding="utf-8")

    local_html=HTMLParser(f.read())

for parsing in local_html.css('header.article-header > h3 > a'):
    
    headlines=parsing.text()
    links=parsing.attributes['href']
    
    print(f'headlines:{headlines} links:{links}')

Comments

Sort byBest