I want to find examples of other Hugo blogs, but they're not really easy to search for. Unless someone put "Hugo" in the descrption (which is actually common) there's no real defining files. However there's a set of files that are in a lot of Hugo Blogs and we can search them in Github with the GHArcive BigQuery Export.
The strategy is that most Hugo blogs will contain a
/themes folder, a
/content folder a
/static folder and a
They don't have to have these, but many of them will as these are very standard structure.
We can then search for them in Bigquery (this is just a 1% sample, to get them all replace
select repo_name, markdown_files from ( SELECT repo_name, logical_or(REGEXP_CONTAINS(path, '^themes/')) as has_theme, logical_or(REGEXP_CONTAINS(path, '^content/')) as has_content, logical_or(REGEXP_CONTAINS(path, '^static/')) as has_static, logical_or(path = 'config.toml') as has_config, count(case when ends_with(path, '.md') then 1 end) as markdown_files FROM `bigquery-public-data.github_repos.sample_files` group by 1 ) where has_theme and has_content and has_static and has_config limit 100
This could then be used for further queries, like finding the most popular themes, the distribution of number of posts, extracting tags from blog posts for a classifier and other things.