Text Processing Tools

Free online tools for document-to-Markdown extraction (PDF, DOCX, XLSX & 76+ formats), regex testing, text diffing, Markdown preview, and Lorem Ipsum generation. Live highlighting, side-by-side comparison, and HTML export. Works offline.

About Text Processing Tools

Text is the universal interface of software engineering, from documents that need converting to Markdown for RAG pipelines and documentation, to regular expressions that parse log files, to Markdown that documents APIs. These tools give you instant feedback: document-to-Markdown extraction from PDF, Word, Excel, PowerPoint and 76+ formats with heading and table detection, live regex highlighting with capture groups, side-by-side text comparison with line/word/character granularity, a Markdown preview that mirrors GitHub rendering, and a Lorem Ipsum generator for rapid prototyping. No signup, no upload, everything runs in your browser.

Frequently Asked Questions

Does the regex tester support all JavaScript regex features?
Yes. The Regex Tester uses your browser's native RegExp engine, so it supports all ES2024 features including lookbehind assertions, named capture groups, the dotAll flag (s), and Unicode property escapes (\p{...}).
Can I compare two files side by side?
Yes. The Diff Checker lets you paste or type text into two panels and instantly highlights additions, deletions, and modifications. You can switch between line-level, word-level, and character-level comparison modes.
Does the Markdown preview support GitHub-Flavored Markdown?
The Markdown Preview supports standard CommonMark syntax plus extensions like tables, task lists, and fenced code blocks with syntax highlighting. Output is sanitized with DOMPurify to prevent XSS.
How do I generate placeholder text for my designs?
The Lorem Ipsum Generator lets you choose the number of paragraphs, sentences, or words to generate. Copy the output with one click and paste it into your mockups, wireframes, or templates.
Can I use regex patterns from other languages like Python or Go?
Most regex syntax is shared across languages, but there are differences. The tester uses JavaScript's regex engine, so Python-specific features like (?P) named groups should be written as (?) instead. PCRE features like recursive patterns are not supported.