使用Python在Selenium WebDriver中获取WebElement的HTML源代码

2022-07-11 17:13:16玩技站长

玩技站长

管理员, Keymaster

关注

11178
文章

0
粉丝

测试交流18421字数 134阅读0分26秒阅读模式

我正在使用Python绑定来运行Selenium WebDriver：

from selenium import webdriver
wd = webdriver.Firefox()

我知道我可以抓取这样的Web元素：文章源自玩技e族-https://www.playezu.com/179020.html

elem = wd.find_element_by_css_selector('#my-id')

我知道我可以得到完整的来源与。。。文章源自玩技e族-https://www.playezu.com/179020.html

wd.page_source

但是有没有办法获得“元素源”？文章源自玩技e族-https://www.playezu.com/179020.html

elem.source   # <-- returns the HTML as a string

用于Python的Selenium WebDriver文档基本上不存在，我在代码中没有看到任何支持该功能的内容。文章源自玩技e族-https://www.playezu.com/179020.html

访问元素（及其子元素）的HTML的最佳方式是什么？文章源自玩技e族-https://www.playezu.com/179020.html 文章源自玩技e族-https://www.playezu.com/179020.html

版权提示：非本站文章仅供存储任何法律责任由作者承担▷诈骗举报◁▷新闻不符◁▷我要投稿◁
免责声明：部分内容来自用户上传发布或新闻客户端自媒体如有侵权请反馈站长处理
原创转载：阅读转载说明>>> https://www.playezu.com/179020.html

评论 18 访客 18

Dima Tisnek 9
2022-07-11 17:06:11 未知地区 18F
回复
WebElement element = driver.findElement(By.id("foo"));
String contents = (String)((JavascriptExecutor)driver).executeScript("return arguments[0].innerHTML;", element);
This code really works to get JavaScript from source as well!
wowandy 9
2022-07-11 17:06:11 未知地区 17F
回复
In PHP Selenium WebDriver you can get page source like this:
$html = $driver->getPageSource();
Or get HTML of the element like this:
// innerHTML if you need HTML of the element content
$html = $element->getDomProperty(‘outerHTML’);
christian 9
2022-07-11 17:06:11 未知地区 16F
回复
In current versions of php-webdriver (1.12.0+) you to use
$element->getDomProperty(‘innerHTML’);
as pointed out in this issue: https://github.com/php-webdriver/php-webdriver/issues/929
user2849367 9
2022-07-11 17:06:10 未知地区 15F
回复
Use execute_script get html
bs4(BeautifulSoup) also can access html tag quickly.
from bs4 import BeautifulSoup
html = adriver.execute_script("return document.documentElement.outerHTML")
bs4_onepage_object=BeautifulSoup(html,"html.parser")
bs4_div_object=bs4_onepage_object.find_all("atag",class_="attribute")
Peter Mortensen 9
2022-07-11 17:06:10 未知地区 14F
回复
And in PHPUnit Selenium test it’s like this:
$text = $this->byCssSelector(‘.some-class-nmae’)->attribute(‘innerHTML’);
Peter Mortensen 9
2022-07-11 17:06:10 未知地区 13F
回复
If you are interested in a solution for Selenium Remote Control in Python, here is how to get innerHTML:
innerHTML = sel.get_eval("window.document.getElementById(‘prodid’).innerHTML")
Peter Mortensen 9
2022-07-11 17:06:10 未知地区 12F
回复
The method to get the rendered HTML I prefer is the following:
driver.get("http://www.google.com")
body_html = driver.find_element_by_xpath("/html/body")
print body_html.text
However, the above method removes all the tags (yes, the nested tags as well) and returns only text content. If you interested in getting the HTML markup as well, then use the method below.
print body_html.getAttribute("innerHTML")
MaartenDev 9
2022-07-11 17:06:10 未知地区 11F
回复
This works seamlessly for me.
element.get_attribute(‘innerHTML’)

测试交流

测试分享

百科知识

经验总结