Skip to content

Making listing of all "external" linked pages documentation pages referenced in site #578

@kltm

Description

@kltm

The request from @suzialeksander is to make a list of all "external" linked pages documentation pages referenced on the site. As a simplifying assumption, if it's not "internal", it will need a full URL.

The command filter I tried is:

  • look in all files in geneontology.github.io
  • look for anything that looks like a URL
  • remove any trailing punctuation
  • deduplicate it
  • look for things that contain "geneontology" (as we're pointing to "ourselves")
  • filter out:
    • wiki
    • github
    • current/snapshot/release
    • model
    • amigo
    • api
    • GO_REFs
    • external2go
    • /docs/
    • /page/
grep -r -h -Eo "(http|https)://[a-zA-Z0-9./?=_%:-]*" * | sed 's:/*$::' | sed 's:\.*$::' | sort -u | uniq | grep geneontology | grep -v wiki | grep -v github.com | grep -v githubuser | grep -v current.gene | grep -v snapshot.gene | grep -v release.gene | grep -v model.gene | grep -v amigo.gene | grep -v api.gene | grep -v GO_REF | grep -v external2go | grep -v '/docs/'  | grep -v '/page/'

The results look like:

http://archive.geneontology.org/full
http://geneontology.org
http://geneontology.org/404.html
http://geneontology.org/allposts.html
http://geneontology.org/covid-19.html
http://geneontology.org/faq/what-gene-product
http://geneontology.org/feed.xml
http://geneontology.org/form/contact-go
http://geneontology.org/gene-associations
http://geneontology.org/go-cam
http://geneontology.org/GO.contacts.shtml
http://geneontology.org/gorefs.html
http://geneontology.org/ontology
http://geneontology.org/ontology/gene_ontology.obo
http://geneontology.org/ribbon.html
http://geneontology.org/search.html
http://geneontology.org/sitemap.xml
http://geneontology.org/sparql.html
http://geneontology.org/stats.html
http://golr-aux.geneontology.io/solr/select?fq=document_category:%22annotation%22
http://golr-aux.geneontology.io/solr/select?fq=document_category:%22ontology_class%22
http://help.geneontology.org
http://noctua.geneontology.org
http://noctua.geneontology.org/editor/graph/gomodel:5b91dbd100001993
https://geneontology.cloud
https://geneontology.cloud/browse
https://geneontology.github.io
https://geneontology.org
https://geneontology.org/favicon.ico
https://geneontology.org/go-cam
https://geneontology.org/stats.html
https://geneontology-public.s3.amazonaws.com/assets/ALLIANCE-logo-nobackground_foundingmember.png
https://geneontology-public.s3.amazonaws.com/assets/font-awesome.min.css
https://geneontology-public.s3.amazonaws.com/assets/go-logo.large.png
https://help.geneontology.org
http://sourceforge.net/projects/geneontology
https://rdf.geneontology.org/blazegraph/sparql
https://s3.amazonaws.com/geneontology-public/gocam/GO-CAMs.ttl.zip
http://www.geneontology.org/GO.evidence.shtml

Metadata

Metadata

Labels

questionFurther information is requested

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions