To what extent are the results of research investigations influenced by subjective decisions that scientists make as they design studies? Fifteen research teams independently designed studies to answer five original research questions related to moral judgments, negotiations, and implicit cognition. Participants from two separate, large samples (total N > 15,000) were then randomly assigned to complete one version of each study. Effect sizes varied dramatically across different sets of materials designed to test the same hypothesis: materials from different teams rendered significant effects in opposite directions for four out of five hypotheses, with the narrowest range in estimates being d = -0.37 to 0.26. Meta-analysis indicated a lack of overall support for two original hypotheses, mixed support for one hypothesis, and significant support for two hypotheses. Overall, none of the variability in effect sizes was attributable to the skill of the research team in designing materials, while some variability was attributable to the hypothesis being tested. In a forecasting survey, predictions of other scientists were strongly correlated with study results, and average predictions were similar to observed outcomes. Crowdsourced testing of research hypotheses helps reveal the true consistency of empirical support for a scientific claim.

Landy, J. F., (liam) Jia, M., Ding, I., Viganola, D., Tierney, W., Dreber, A., Johannesson, M., Pfeiffer, T., Ebersole, C. R., Gronau, Q. F., Ly, A., Van Den Bergh, D., Marsman, M., Wagenmakers, E., Bartels, D. M., Bauman, C. W., Brady, W., Cheung, F., Cimpian, A., Dohle, S., Brent Donnellan, M., Hahn, A., Hall, M., Jiménez-Leal, W., Johnson, D. J., Lucas, R. E., Monin, B., Montealegre, A., Mullen, E., Pang, J., Ray, J., Reinero, D. A., Reynolds, J., Sowden, W., Storage, D., Su, R., Tworek, C. M., Van Bavel, J. J., Walco, D., Wills, J., Xu, X., Chi Yam, K., Yang, X., Schweinsberg, M., Urwitz, M., Adamkovič, M., Alaei, R., Albers, C. J., Allard, A., Anderson, I. A., Andreychik, M. R., Babinčák, P., Baker, B. J., Baník, G., Baskin, E., Bavolar, J., Berkers, R. M. W. J., Białek, M., Blanke, J., Breuer, J., Brizi, A., Brown, S. E. V., Brühlmann, F., Bruns, H., Caldwell, L., Campourcy, J., Chan, E. Y., Chang, Y., Cheung, B. Y., Chin, A., Cho, K. W., Columbus, S., Conway, P., Corretti, C. A., Craig, A. W., Curran, P. G., Danvers, A. F., Dawson, I. G. J., Day, M. V., Dietl, E., Doerflinger, J. T., Dominici, A., Dranseika, V., Edelsbrunner, P. A., Edlund, J. E., Fisher, M., Fung, A., Genschow, O., Gnambs, T., Goldberg, M. H., Graf-Vlachy, L., Hafenbrack, A. C., Hafenbrädl, S., Hartanto, A., Heck, P. R., Heffner, J. P., Hilgard, J., Holzmeister, F., Horchak, O. V., Huang, T. S. -., Hüffmeier, J., Hughes, S., Hussey, I., Imhoff, R., Jaeger, B., Jamro, K., Johnson, S. G. B., Jones, A., Keller, L., Kombeiz, O., Krueger, L. E., Lantian, A., Laplante, J. P., Lazarevic, L. B., Leclerc, J., Legate, N., Leonhardt, J. M., Leung, D. W., Levitan, C. A., Lin, H., Liu, Q., Tullio Liuzza, M., Locke, K. D., Ly, A. L., Maceacheron, M. D., Madan, C. R., Manley, H., Mari, S., Martončik, M., Mclean, S. L., Mcphetres, J., Mercier, B. G., Michels, C., Mullarkey, M. C., Musser, E. D., Nalborczyk, L., Nilsonne, G., Otis, N. G., Otner, S. M. G., Otto, P. E., Oviedo-Trespalacios, O., Paruzel- Czachura, M., Pellegrini, F., Pereira, V. M. D., Perfecto, H., Pfuhl, G., Phillips, M. H., Plonsky, O., Pozzi, M., Purić, D. B., Raymond-Barker, B., Redman, D. E., Reynolds, C. J., Ropovik, I., Röseler, L., Ruessmann, J. K., Ryan, W. H., Sablaturova, N., Schuepfer, K. J., Schütz, A., Sirota, M., Stefan, M., Stocks, E. L., Strosser, G. L., Suchow, J. W., Szabelska, A., Tey, K. S., Tiokhin, L., Troian, J., Utesch, T., Vásquez-Echeverría, A., Ann Vaughn, L., Verschoor, M., Von Helversen, B., Wallisch, P., Weissgerber, S. C., Wichman, A. L., Woike, J. K., Žeželj, I., Zickfeld, J. H., Ahn, Y., Blaettchen, P. F., Kang, X., Jin Lee, Y., Parker, P. M., Parker, P. A., Song, J. S., Very, M., Wong, L., Uhlmann, E. L., Crowdsourcing hypothesis tests: Making transparent how design choices shape research results, <<PSYCHOLOGICAL BULLETIN>>, 5; 146 (5): 451-479. [doi:10.1037/bul0000220] [http://hdl.handle.net/10807/146268]

Crowdsourcing hypothesis tests: Making transparent how design choices shape research results

Pozzi, Maura;
2020

Abstract

To what extent are the results of research investigations influenced by subjective decisions that scientists make as they design studies? Fifteen research teams independently designed studies to answer five original research questions related to moral judgments, negotiations, and implicit cognition. Participants from two separate, large samples (total N > 15,000) were then randomly assigned to complete one version of each study. Effect sizes varied dramatically across different sets of materials designed to test the same hypothesis: materials from different teams rendered significant effects in opposite directions for four out of five hypotheses, with the narrowest range in estimates being d = -0.37 to 0.26. Meta-analysis indicated a lack of overall support for two original hypotheses, mixed support for one hypothesis, and significant support for two hypotheses. Overall, none of the variability in effect sizes was attributable to the skill of the research team in designing materials, while some variability was attributable to the hypothesis being tested. In a forecasting survey, predictions of other scientists were strongly correlated with study results, and average predictions were similar to observed outcomes. Crowdsourced testing of research hypotheses helps reveal the true consistency of empirical support for a scientific claim.
2020
Inglese
Landy, J. F., (liam) Jia, M., Ding, I., Viganola, D., Tierney, W., Dreber, A., Johannesson, M., Pfeiffer, T., Ebersole, C. R., Gronau, Q. F., Ly, A., Van Den Bergh, D., Marsman, M., Wagenmakers, E., Bartels, D. M., Bauman, C. W., Brady, W., Cheung, F., Cimpian, A., Dohle, S., Brent Donnellan, M., Hahn, A., Hall, M., Jiménez-Leal, W., Johnson, D. J., Lucas, R. E., Monin, B., Montealegre, A., Mullen, E., Pang, J., Ray, J., Reinero, D. A., Reynolds, J., Sowden, W., Storage, D., Su, R., Tworek, C. M., Van Bavel, J. J., Walco, D., Wills, J., Xu, X., Chi Yam, K., Yang, X., Schweinsberg, M., Urwitz, M., Adamkovič, M., Alaei, R., Albers, C. J., Allard, A., Anderson, I. A., Andreychik, M. R., Babinčák, P., Baker, B. J., Baník, G., Baskin, E., Bavolar, J., Berkers, R. M. W. J., Białek, M., Blanke, J., Breuer, J., Brizi, A., Brown, S. E. V., Brühlmann, F., Bruns, H., Caldwell, L., Campourcy, J., Chan, E. Y., Chang, Y., Cheung, B. Y., Chin, A., Cho, K. W., Columbus, S., Conway, P., Corretti, C. A., Craig, A. W., Curran, P. G., Danvers, A. F., Dawson, I. G. J., Day, M. V., Dietl, E., Doerflinger, J. T., Dominici, A., Dranseika, V., Edelsbrunner, P. A., Edlund, J. E., Fisher, M., Fung, A., Genschow, O., Gnambs, T., Goldberg, M. H., Graf-Vlachy, L., Hafenbrack, A. C., Hafenbrädl, S., Hartanto, A., Heck, P. R., Heffner, J. P., Hilgard, J., Holzmeister, F., Horchak, O. V., Huang, T. S. -., Hüffmeier, J., Hughes, S., Hussey, I., Imhoff, R., Jaeger, B., Jamro, K., Johnson, S. G. B., Jones, A., Keller, L., Kombeiz, O., Krueger, L. E., Lantian, A., Laplante, J. P., Lazarevic, L. B., Leclerc, J., Legate, N., Leonhardt, J. M., Leung, D. W., Levitan, C. A., Lin, H., Liu, Q., Tullio Liuzza, M., Locke, K. D., Ly, A. L., Maceacheron, M. D., Madan, C. R., Manley, H., Mari, S., Martončik, M., Mclean, S. L., Mcphetres, J., Mercier, B. G., Michels, C., Mullarkey, M. C., Musser, E. D., Nalborczyk, L., Nilsonne, G., Otis, N. G., Otner, S. M. G., Otto, P. E., Oviedo-Trespalacios, O., Paruzel- Czachura, M., Pellegrini, F., Pereira, V. M. D., Perfecto, H., Pfuhl, G., Phillips, M. H., Plonsky, O., Pozzi, M., Purić, D. B., Raymond-Barker, B., Redman, D. E., Reynolds, C. J., Ropovik, I., Röseler, L., Ruessmann, J. K., Ryan, W. H., Sablaturova, N., Schuepfer, K. J., Schütz, A., Sirota, M., Stefan, M., Stocks, E. L., Strosser, G. L., Suchow, J. W., Szabelska, A., Tey, K. S., Tiokhin, L., Troian, J., Utesch, T., Vásquez-Echeverría, A., Ann Vaughn, L., Verschoor, M., Von Helversen, B., Wallisch, P., Weissgerber, S. C., Wichman, A. L., Woike, J. K., Žeželj, I., Zickfeld, J. H., Ahn, Y., Blaettchen, P. F., Kang, X., Jin Lee, Y., Parker, P. M., Parker, P. A., Song, J. S., Very, M., Wong, L., Uhlmann, E. L., Crowdsourcing hypothesis tests: Making transparent how design choices shape research results, <<PSYCHOLOGICAL BULLETIN>>, 5; 146 (5): 451-479. [doi:10.1037/bul0000220] [http://hdl.handle.net/10807/146268]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/146268
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 88
  • ???jsp.display-item.citation.isi??? 77
social impact