{"id":758,"date":"2024-12-23T07:02:41","date_gmt":"2024-12-23T07:02:41","guid":{"rendered":"https:\/\/mailitics.com\/index.php\/2024\/12\/23\/statisticians_scripts_and_chaos_my_journey_back\/"},"modified":"2024-12-23T07:02:41","modified_gmt":"2024-12-23T07:02:41","slug":"statisticians_scripts_and_chaos_my_journey_back","status":"publish","type":"post","link":"https:\/\/mailitics.com\/index.php\/2024\/12\/23\/statisticians_scripts_and_chaos_my_journey_back\/","title":{"rendered":"Statisticians, Scripts, and Chaos: My Journey Back to the 90s"},"content":{"rendered":"<p>    Statisticians, Scripts, and Chaos: My Journey Back to the 90s<br \/>\n \t<BR><br \/>\n<BR><\/BR><br \/>\n    <!-- no image --><br \/>\n \t<BR><br \/>\n<BR><\/BR><\/p>\n<div>\n<!-- SC_OFF --><\/p>\n<div class=\"md\">\n<p>We often hear a lot about how data science teams can lack statistical expertise and how this can lead to flawed analyses or misinterpretation of results. It\u2019s a valid concern, and the dangers are real. But let me tell you, there\u2019s another side of the coin that had me saying, \u201cHoly bleep.\u201d<\/p>\n<p>This year, I joined a project where the team is dominated by statisticians and economists. Sounds like a data science dream team, right? Not so fast. It feels like I hopped into a time machine and landed in the 90s. Git? Never heard of it. Instead, we\u2019ve got the old-school hierarchy of script_v1, script_final_version_1, script_final_version_2, all the way to script_final_version_n. It&#8217;s a wild ride.<\/p>\n<p>Code reviews? Absolutely nonexistent. Every script is its own handcrafted masterpiece, riddled with what I can only describe as &#8220;surprise features&#8221; in the preprocessing pipeline. Bugs aren\u2019t bugs, apparently. \u201cIf you just pay close attention and read your code twice, you\u2019ll see there\u2019s no issue,\u201d they tell me. Uh, sure. I don\u2019t trust a single output right now because I know that behind every analysis bugs are having the party of their lives. <\/p>\n<p>Chances are, statisticians have absolutely no idea how a modern database actually works, have never heard of a non-basic data structure like a HyperLogLog, and have likely never wrestled with a truly messy real-world dataset.<\/p>\n<\/p><\/div>\n<p><!-- SC_ON -->   submitted by   <a href=\"https:\/\/www.reddit.com\/user\/Raz4r\"> \/u\/Raz4r <\/a> <br \/> <span><a href=\"https:\/\/www.reddit.com\/r\/datascience\/comments\/1hjluem\/statisticians_scripts_and_chaos_my_journey_back\/\">[link]<\/a><\/span>   <span><a href=\"https:\/\/www.reddit.com\/r\/datascience\/comments\/1hjluem\/statisticians_scripts_and_chaos_my_journey_back\/\">[comments]<\/a><\/span>\n<\/div>\n<p> \t<BR><br \/>\n <BR><\/BR><br \/>\n    \/u\/Raz4r<br \/>\n \t<BR><br \/>\n<BR><\/BR><br \/>\n<a href=\"https:\/\/www.reddit.com\/r\/datascience\/comments\/1hjluem\/statisticians_scripts_and_chaos_my_journey_back\/\">Go to original source<\/a><br \/>\n \t<BR><br \/>\n <BR><\/BR><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Statisticians, Scripts, and Chaos: My Journey Back to the 90s We often hear a lot about how data science teams can lack statistical expertise and how this can lead to flawed analyses or misinterpretation of results. It\u2019s a valid concern, and the dangers are real. But let me tell you, there\u2019s another side of the [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[62,99],"tags":[868,866,867],"class_list":["post-758","post","type-post","status-publish","format-standard","hentry","category-aimldsaimlds","category-datascience","tag-s","tag-script","tag-statisticians"],"_links":{"self":[{"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/posts\/758"}],"collection":[{"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/comments?post=758"}],"version-history":[{"count":0,"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/posts\/758\/revisions"}],"wp:attachment":[{"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/media?parent=758"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/categories?post=758"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/tags?post=758"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}