Extract text content in HTML under VC! Wait online! anxious! A lot of pointers!

kingfrog · Post time: 2020-3-12 23:30:01

How to extract text content in HTML using webbrowers control?

Example:
html file: <TD class = text_b_12_1 style = "PADDING-LEFT: 30px" colSpan = 2
 height = 30> Volume 1 The Six Houses Episode 1 </ TD> </ TR>
 <TR>
 <TD class = text_o_12_2 align = middle colSpan = 2 height = 50> 
 Legend of the Seven Realms </ TD> </ TR>
 <TR>
 <TD class = text_b_14_1 colSpan = 2>
 China has beautiful mountains and rivers. For thousands of years, countless magic legends have been circulating on this land. Since ancient times, people have always talked about those legends about immortality and immortality. Since ancient times, mortals have all died. But everyone in the world loves life and death, and Yan Luozhi of the prefecture said that it added a little bit of fear. Below this, there is a saying that immortality is immortal, which makes people dream of life. (From The Sword Book Alliance) 

Extract it into
Seven Realms
China has beautiful mountains and rivers. For thousands of years, countless magic legends have been circulating on this land. Since ancient times, people have always talked about those legends about immortality and immortality. Since ancient times, mortals have all died. But everyone in the world loves life and death, and Yan Luozhi of the prefecture said that it added a little bit of fear. Below this, there is a saying that immortality is immortal, which makes people dream of life.

Thank you! very urgent.

okokok3030 · Post time: 2020-6-10 19:00:01

No need to use webbrowers, just use wininet, then use regular

siegfried008 · Post time: 2020-7-2 19:00:01

IHTMLElement::innerText

Or as the upstairs said.

1231456 · Post time: 2020-7-7 14:00:01

If i use lex

ayasakura · Post time: 2020-8-1 21:15:01

Parse yourself
<>***<> Just stay in between

牛罗锅 · Post time: 2020-8-8 14:45:01

I also agree with the analysis upstairs by myself, CString find will come out in a few clicks,

yangshuang · Post time: 2020-8-13 11:30:01

Parse it yourself, I wrote a similar before, not difficult

wintermaul · Post time: 2020-8-21 05:45:01

Using DOM to fetch text directly should be a relatively lightweight solution

hehehehe · Post time: 2020-8-22 16:45:01

const string ExtractHTML( const string&strHTML)
{
string strTemp = strHTML;
while( true)
{
size_t szPos = strTemp.find( "<" );
if( string::npos == szPos)
return strTemp;
size_t szEnd = strTemp.find( ">", szPos );
if( string::npos == szEnd)
return strTemp;
strTemp.erase( szPos, szEnd-szPos + 1 );
}
}

深蓝旅者 · Post time: 2020-8-22 19:30:01

study the walkall sample in MSDN
http://msdn.microsoft.com/archive/default.asp?url=/archive/en-us/samples/internet/browser/walkall/default.asp

		Remember me	Forgot password?
Password			Register