What is HTML Transcoding?
The updated version of this tutorial (based on the latest webpage) is available now. Go to have a check here!
HTML transcoding is a kind of data re-format, which converts some HTML tags into plain text to help users to observe the source code easily after they extract the HTML of a web. For example, it can transcode “>” into “>” or “<” into a “<”.
It is easy to find it if you follow these steps:
Choose a data field ➜Click "Customize Field". ➜Click "Re-format extracted data".➜Click “Add steps”.➜ Choose “Html transcoding”.
All the conversion will be automatically done well after you click OK.
This function actually is seldom used compared to other data re-format functions such as “Replace with Regular Expression”. Click here to know more about the powerful functions of data re-format, helping to make your data clearer!
Author: The Octoparse Team
For more information about Octoparse, please click here.
Sign up today!