下面的代码成功地遍历DOM中的每个元素,并将每个元素放入Excel表中。(tagName、ID、className等)
我的问题是:
我如何刮标签装饰品(标题,href等)每个元素?具体来说,对于"A“标记,如何刮除"href”属性?
Enum READYSTATE
READYSTATE_UNINITIALIZED = 0
READYSTATE_LOADING = 1
READYSTATE_LOADED = 2
READYSTATE_INTERACTIVE = 3
READYSTATE_COMPLETE = 4
End Enum
Dim ie As InternetExplorer
Dim html As HTMLDocument
Dim RowNumber As Integer
Set ie = New InternetExplorer
ie.Visible = False
ie.navigate "www.somesite.com"
Do While ie.READYSTATE <> READYSTATE_COMPLETE
Application.StatusBar = "Connecting..."
DoEvents
Loop
Set html = ie.document
RowNumber = 1
For Each element In html.all
Cells(RowNumber, "A").Value = element.tagName
Cells(RowNumber, "B").Value = element.ID
Cells(RowNumber, "C").Value = element.className
Cells(RowNumber, "D").Value = element.innerHTML
RowNumber = RowNumber + 1
Next element
任何帮助都将不胜感激。
发布于 2016-07-04 04:10:03
在RowNumber = RowNumber + 1
之前添加这一行
If (element.tagName = "A") Then Cells(RowNumber, "E").Value=element.getAttribute("href")
https://stackoverflow.com/questions/38173327
复制相似问题