亚洲在线久爱草,狠狠天天香蕉网,天天搞日日干久草,伊人亚洲日本欧美

為了賬號安全,請及時綁定郵箱和手機立即綁定
已解決430363個問題,去搜搜看,總會有你想問的

如何從此源代碼中提取信息。我想從此鏈接中提取 Name , address,course

如何從此源代碼中提取信息。我想從此鏈接中提取 Name , address,course

qq_笑_17 2022-12-27 15:44:51
我想從此代碼中提取姓名、地址、課程、機構類型。我猜是因為桌子的緣故,我做不到。每次我嘗試它都會給我一個空白列表。我不知道該怎么辦<div class="row">  <div class="col-md-12">    <div class="panel panel-default">      <div class="panel-body ">        <div class="row">          <div id="ContentPlaceHolder1_pnldefault">            <table id="ContentPlaceHolder1_dlstCollege" class="table table-bordered table responsive" cellspacing="0" style="border-collapse:collapse;">              <tr>                <td>                  <input type="hidden" name="ctl00$ContentPlaceHolder1$dlstCollege$ctl00$hdnInstituteId" id="ContentPlaceHolder1_dlstCollege_hdnInstituteId_0" value="968  " />                  <a id="ContentPlaceHolder1_dlstCollege_hlpkInstituteName_0" href="CollegeDetailedInformation.aspx?Inst=968  ">**A R INSTITUTE OF PHARMACY , BIJNOR (968)**</a>                  <br />                  <b>Location:</b>                  <span id="ContentPlaceHolder1_dlstCollege_lblAddress_0">**TAJPUR** </span>                  <br />                  <b>Course:</b>                  <span id="ContentPlaceHolder1_dlstCollege_lblCourse_0">**B.Pharm**,</span>                  <br />                  <b>Category:</b>                  <span id="ContentPlaceHolder1_dlstCollege_lblInstituteType_0">**Private**</span>                  <br />                  <b>Web Address:</b>                  <a id="lnkBtnWebURL" href='' target="_blank"></a>                  <br />                </td>              </tr>res = requests.get('http://kyc.aktu.ac.in/')soup = BeautifulSoup(res.content, 'html.parser')weblinks = soup.find_all('a', attrs = {'id':'ContentPlaceHolder1_dlstCollege_hlpkInstituteName_0'})pagelinks = []for link in weblinks:      link = link.find('a')  pagelinks.append(link.get('href'))
查看完整描述

1 回答

?
慕妹3242003

TA貢獻1824條經驗 獲得超6個贊

試試這個:


from bs4 import BeautifulSoup as bs


html = '<div class="row"><div class="col-md-12"><div class="panel panel-default"><div class="panel-body "><div class="row"><div id="ContentPlaceHolder1_pnldefault"><table id="ContentPlaceHolder1_dlstCollege" class="table table-bordered table responsive" cellspacing="0" style="border-collapse:collapse;"><tr><td><input type="hidden" name="ctl00$ContentPlaceHolder1$dlstCollege$ctl00$hdnInstituteId" id="ContentPlaceHolder1_dlstCollege_hdnInstituteId_0" value="968  " /><a id="ContentPlaceHolder1_dlstCollege_hlpkInstituteName_0" href="CollegeDetailedInformation.aspx?Inst=968  ">**A R INSTITUTE OF PHARMACY , BIJNOR (968)**</a><br /><b>Location:</b><span id="ContentPlaceHolder1_dlstCollege_lblAddress_0">**TAJPUR** </span><br /><b>Course:</b><span id="ContentPlaceHolder1_dlstCollege_lblCourse_0">**B.Pharm**,</span><br /><b>Category:</b><span id="ContentPlaceHolder1_dlstCollege_lblInstituteType_0">**Private**</span><br /><b>Web Address:</b><a id="lnkBtnWebURL" href='' target="_blank"></a><br /></td></tr>'


soup = bs(html , 'lxml')


name = soup.find('a', id='ContentPlaceHolder1_dlstCollege_hlpkInstituteName_0').text.strip()

address = soup.find('span', id= 'ContentPlaceHolder1_dlstCollege_lblAddress_0').text.strip()

course = soup.find('span', id = 'ContentPlaceHolder1_dlstCollege_lblCourse_0').text.strip()

institute_type = soup.find('span', id = 'ContentPlaceHolder1_dlstCollege_lblInstituteType_0').text.strip()


print(name)

print(address)

print(course)

print(institute_type)

輸出:


**A R INSTITUTE OF PHARMACY , BIJNOR (968)**

**TAJPUR**

**B.Pharm**,

**Private**


查看完整回答
反對 回復 2022-12-27
  • 1 回答
  • 0 關注
  • 81 瀏覽
慕課專欄
更多

添加回答

舉報

0/150
提交
取消
微信客服

購課補貼
聯系客服咨詢優惠詳情

幫助反饋 APP下載

慕課網APP
您的移動學習伙伴

公眾號

掃描二維碼
關注慕課網微信公眾號