TL;DR

I simply launched generate-sitemap 1.9.2, a GitHub Action for generating XML sitemaps for static websites. The generate-sitemap GitHub Motion is carried out in Python, and generates an XML sitemap by crawling the GitHub repository containing the html of the location, utilizing commit dates to generate <lastmod> tags within the sitemap.

This launch, generate-sitemap 1.9.2, is primarily a repair for a minor bug within the common expression used to detect whether or not a web page has a meta robots noindex directive within the web page head, and will thus be excluded from the sitemap. Presence of bug revealed by warning message generated by Python 3.12 concerning the common expression in query that Python 3.11 and earlier doesn’t warn about.



Changelog 1.9.2 – 2023-10-05



Fastened

  • Repair minor bug in regex used to detect if a web page has a meta robots noindex directive in head.



CI/CD

  • Bump Python to three.12 in CI/CD workflows when working unit assessments.



Dependencies

  • Bump cicirello/pyaction from 4.14.1 to 4.25.0, together with upgrading Python throughout the Docker container to three.12.



Extra Data

Please take into account starring generate-sitemap’s GitHub repository:

Generate an XML sitemap for a GitHub Pages web site utilizing GitHub Actions

cicirello/generate-sitemap - Generate XML sitemaps for static websites in GitHub Actions

Take a look at all of our GitHub Actions: https://actions.cicirello.org/

The generate-sitemap GitHub motion generates a sitemap for a web site hosted on GitHub
Pages, and has the next options:

  • Assist for each xml and txt sitemaps (you select utilizing one of many motion’s inputs).
  • When producing an xml sitemap, it makes use of the final commit date of
    every file to generate the <lastmod> tag within the sitemap entry. If the file
    was created throughout that workflow run, however not but dedicated, then it as an alternative makes use of
    the present date (nevertheless, we suggest if attainable committing newly created information first).
  • Helps URLs for html and pdf information within the sitemap, and has inputs
    to regulate the included file varieties (defaults embody each html and pdf information within the sitemap).
  • Now additionally helps together with URLs for a consumer specified checklist of
    extra file extensions within the sitemap.
  • …

For extra data, see my earlier put up about generate-sitemap right here on DEV, in addition to its webpage.

The generate-sitemap GitHub motion generates a sitemap for a web site hosted on GitHub Pages. Helps each xml and txt sitemaps. Makes use of the final commit date of every file to generate the lastmod tags in XML sitemaps. Parses robots.txt and scans html information for noindex directives, excluding URLs if noindex directives or disallows discovered.

favicon
actions.cicirello.org



The place You Can Discover Me

Comply with me here on DEV and on GitHub:

Or go to my web site:

Vincent A. Cicirello – Professor of Laptop Science at Stockton College – is a
researcher in synthetic intelligence, evolutionary computation, swarm intelligence,
and computational intelligence, with a Ph.D. in Robotics from Carnegie Mellon
College. He’s an ACM Senior Member, IEEE Senior Member, AAAI Life Member,
EAI Distinguished Member, and SIAM Member.

favicon
cicirello.org

source