ive just learned something so fucking funny about bad data sets. there's a presentation im sitting in where an AI is trained on covid-19 research papers and asked to transform a conclusion section, and always comes out with "the covid-19 vaccine requires more research to prove its safety", not because the data is harmful per-se, but because researchers who have a vested interest in maintaining grant funding regularly include in their conclusions that more research must be done, even positively