HOW TO strip HTML tags and show just web page text

Using C# & Regular Expressions, it can be done like this -

string stripped = System.Text.RegularExpressions.Regex.Replace(text,@"<(.|\n)*?>",string.Empty);

The lean, mean & Swiss knife of an editor, Editplus has a short cut for it - Ctrl + Shift + P or from the menu:
Edit > HTML > Strip HTML tags

Also see: HOW TO convert HTML content to plain text with Excel

Comments

  1. EditPlus rocks!
    Used it for 8 years and counting....
    Nice RegEx - thanks

    ReplyDelete

Post a Comment

Popular posts from this blog

Maven Crash Course - Learn Power Query, Power Pivot & DAX in 15 Minutes

"Data Prep & Exploratory Data Analysis" course by Maven Analytics

Oracle Cloud Infrastructure 2024 Generative AI Professional Course & Certification Exam (1Z0-1127-24)