-
Notifications
You must be signed in to change notification settings - Fork 9
Open
Labels
questionFurther information is requestedFurther information is requested
Description
The request from @suzialeksander is to make a list of all "external" linked pages documentation pages referenced on the site. As a simplifying assumption, if it's not "internal", it will need a full URL.
The command filter I tried is:
- look in all files in geneontology.github.io
- look for anything that looks like a URL
- remove any trailing punctuation
- deduplicate it
- look for things that contain "geneontology" (as we're pointing to "ourselves")
- filter out:
- wiki
- github
- current/snapshot/release
- model
- amigo
- api
- GO_REFs
- external2go
- /docs/
- /page/
grep -r -h -Eo "(http|https)://[a-zA-Z0-9./?=_%:-]*" * | sed 's:/*$::' | sed 's:\.*$::' | sort -u | uniq | grep geneontology | grep -v wiki | grep -v github.com | grep -v githubuser | grep -v current.gene | grep -v snapshot.gene | grep -v release.gene | grep -v model.gene | grep -v amigo.gene | grep -v api.gene | grep -v GO_REF | grep -v external2go | grep -v '/docs/' | grep -v '/page/'
The results look like:
http://archive.geneontology.org/full
http://geneontology.org
http://geneontology.org/404.html
http://geneontology.org/allposts.html
http://geneontology.org/covid-19.html
http://geneontology.org/faq/what-gene-product
http://geneontology.org/feed.xml
http://geneontology.org/form/contact-go
http://geneontology.org/gene-associations
http://geneontology.org/go-cam
http://geneontology.org/GO.contacts.shtml
http://geneontology.org/gorefs.html
http://geneontology.org/ontology
http://geneontology.org/ontology/gene_ontology.obo
http://geneontology.org/ribbon.html
http://geneontology.org/search.html
http://geneontology.org/sitemap.xml
http://geneontology.org/sparql.html
http://geneontology.org/stats.html
http://golr-aux.geneontology.io/solr/select?fq=document_category:%22annotation%22
http://golr-aux.geneontology.io/solr/select?fq=document_category:%22ontology_class%22
http://help.geneontology.org
http://noctua.geneontology.org
http://noctua.geneontology.org/editor/graph/gomodel:5b91dbd100001993
https://geneontology.cloud
https://geneontology.cloud/browse
https://geneontology.github.io
https://geneontology.org
https://geneontology.org/favicon.ico
https://geneontology.org/go-cam
https://geneontology.org/stats.html
https://geneontology-public.s3.amazonaws.com/assets/ALLIANCE-logo-nobackground_foundingmember.png
https://geneontology-public.s3.amazonaws.com/assets/font-awesome.min.css
https://geneontology-public.s3.amazonaws.com/assets/go-logo.large.png
https://help.geneontology.org
http://sourceforge.net/projects/geneontology
https://rdf.geneontology.org/blazegraph/sparql
https://s3.amazonaws.com/geneontology-public/gocam/GO-CAMs.ttl.zip
http://www.geneontology.org/GO.evidence.shtml
Metadata
Metadata
Assignees
Labels
questionFurther information is requestedFurther information is requested