Skip to content

citeproc output for gfm includes raw html (even w/ -raw_html) #7965

@jjallaire

Description

@jjallaire

Consider this document gfmcite.md:

@WatsonCrick1953

---
references:
- type: article-journal
  id: WatsonCrick1953
  author:
  - family: Watson
    given: J. D.
  - family: Crick
    given: F. H. C.
  issued:
    date-parts:
    - - 1953
      - 4
      - 25
  title: 'Molecular structure of nucleic acids: a structure for
    deoxyribose nucleic acid'
  title-short: Molecular structure of nucleic acids
  container-title: Nature
  volume: 171
  issue: 4356
  page: 737-738
  DOI: 10.1038/171737a0
  URL: https://www.nature.com/articles/171737a0
  language: en-GB
---

Converting to GFM includes raw html tags surrounding the bibliography:

pandoc gfmcite.md --to gfm --citeproc
Watson and Crick (1953)

<div id="refs" class="references csl-bib-body hanging-indent">

<div id="ref-WatsonCrick1953" class="csl-entry">

Watson, J. D., and F. H. C. Crick. 1953. “Molecular Structure of Nucleic
Acids: A Structure for Deoxyribose Nucleic Acid.” *Nature* 171 (4356):
737–38. <https://doi.org/10.1038/171737a0>.

</div>

</div>

Perhaps by design? (if so I'm not sure whether any GFM environment would actually do anything w/ those tags). If we attempt to remove raw_html we get the same output:

pandoc gfmcite.md --to gfm-raw_html --citeproc

However if we do the same w/ commonmark the HTML is excluded as expected:

pandoc gfmcite.md --to commonmark-raw_html --citeproc
Watson and Crick (1953)

Watson, J. D., and F. H. C. Crick. 1953. “Molecular Structure of Nucleic
Acids: A Structure for Deoxyribose Nucleic Acid.” *Nature* 171 (4356):
737–38. <https://doi.org/10.1038/171737a0>.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions