Frequently Asked Questions
How do wildcard filters work?
Use * (asterisk) as a wildcard that matches any characters. For example: /blog/* matches all URLs starting with /blog/, *.pdf matches all PDF files, and */category/* matches any URL containing /category/ anywhere in the path.
What's the maximum number of URLs I can extract?
You can extract up to 10,000 URLs in a single request. If your sitemap contains more URLs, use the "Max URLs" setting to control how many are returned. For very large sitemaps, consider using filters to narrow down the results.
Can I extract from a sitemap index?
Yes! The tool automatically detects sitemap indexes (sitemapindex) and recursively fetches all child sitemaps to extract their URLs. The total count will include URLs from all nested sitemaps.
What format is the exported file?
You have two export options: CSV (includes URL and lastmod columns, can be opened in Excel) and TXT (one URL per line, simple format). You can also copy all URLs directly to your clipboard.
How do include and exclude filters interact?
Include filters are applied first - if set, only URLs matching at least one include pattern are kept. Then exclude filters remove any URLs matching exclude patterns. This lets you say "get all /blog/* URLs except *.pdf files".