You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently, there is a small subset of articles that are retaining leading formatting boxes in the text. The main parser for the Wikipedia dataset filters most of this out but for about 7K articles it doesn't. This is causing the logic that builds the abstract text to pull the wrong text.