Skip to content

author["name"] causing crash #175

@marban

Description

@marban

Since upgrading to 3.1.15 and even though I don't use any author attributes I'm getting:

  File "/Library/Frameworks/Python.framework/Versions/3.10/lib/python3.10/site-packages/goose3/__init__.py", line 125, in extract
    return self.__crawl(crawl_candidate)
  File "/Library/Frameworks/Python.framework/Versions/3.10/lib/python3.10/site-packages/goose3/__init__.py", line 153, in __crawl
    return crawler_wrapper(self.config.parser_class, parsers, crawl_candidate)
  File "/Library/Frameworks/Python.framework/Versions/3.10/lib/python3.10/site-packages/goose3/__init__.py", line 141, in crawler_wrapper
    article = crawler.crawl(crawl_candidate)
  File "/Library/Frameworks/Python.framework/Versions/3.10/lib/python3.10/site-packages/goose3/crawler.py", line 135, in crawl
    return self.process(raw_html, parse_candidate.url, parse_candidate.link_hash)
  File "/Library/Frameworks/Python.framework/Versions/3.10/lib/python3.10/site-packages/goose3/crawler.py", line 183, in process
    self.article._authors = self.authors_extractor.extract()
  File "/Library/Frameworks/Python.framework/Versions/3.10/lib/python3.10/site-packages/goose3/extractors/authors.py", line 27, in extract
    authors_from_schema = self.__get_authors_from_schema()
  File "/Library/Frameworks/Python.framework/Versions/3.10/lib/python3.10/site-packages/goose3/extractors/authors.py", line 72, in __get_authors_from_schema
    authors.append(author["name"])
TypeError: string indices must be integers

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions