{"id":151,"date":"2017-10-17T05:01:16","date_gmt":"2017-10-16T22:01:16","guid":{"rendered":"http:\/\/dataexponent.com\/?p=151"},"modified":"2018-06-26T10:30:58","modified_gmt":"2018-06-26T03:30:58","slug":"quick-analysis-of-selected-project-reports-to-determine-project-reporting-quality","status":"publish","type":"post","link":"https:\/\/dataexponent.com\/?p=151","title":{"rendered":"Quick Analysis of Selected Project Reports to Determine Project Reporting \u201cQuality\u201d"},"content":{"rendered":"<p>Working as a Project Manager for multiple for-profit and not-for-profit organizations I have seen a lot, I mean a lot, of project reports. Some of the places I&#8217;ve had the opportunity to work for are really in to project reporting, some not so much.<\/p>\n<p>Learning\u00a0Big Data tools I&#8217;ve often had questions of my own that I&#8217;ve asked myself. One was, can I get a measure of project report &#8220;quality&#8221; without actually reading the report? Can I look at all the reports in the organization and find good project report writers? Can I find projects that consistently produce good project reports? Bad project reports? How do my own project reports stack up to others? How does the &#8220;quality&#8221; of the project reports change over time? Does project report &#8220;quality&#8221; have any relationship to project &#8220;quality&#8221;?<!--more--><\/p>\n<p>Thinking\u00a0it I decided there were two measures that seemed important to me as I read and reviewed project reports; did they say something new and how much effort went into the report.<\/p>\n<p style=\"padding-left: 60px;\"><em><strong>Uniqueness<\/strong><\/em> \u2013computed from the number of unique words in each of a project\u2019s narrative reports (perhaps just one type of many reports that might be required) compared to all the words used in the project\u2019s narrative reports to date. (As a technical note this uniqueness number decreases over time as it is more and more difficult to write unique content in project reports. To straighten out this effect the log of the number of unique words is used in comparisons between projects.)<\/p>\n<p style=\"padding-left: 60px;\"><em><strong>Effort<\/strong><\/em> \u2013 calculated as the number of photos and other documents submitted with the narrative report.<\/p>\n<p>I assume that\u00a0\u201c<em><strong>Uniqueness<\/strong><\/em>\u201d and \u201c<em><strong>Effort<\/strong><\/em>\u201d are valid proxy measurements for good project report writing, which is a valid proxy measure for good project monitoring, which is a valid proxy measure for good project implementation&#8230;<\/p>\n<p>All this was done by calculating the size of the initial Bag of Words (BoW) for the very first project narrative report and then comparing that to the size of BoWs for subsequent project narrative reports over the life of the project.<\/p>\n<p>What I found was that I couldn&#8217;t convince myself that the size of the initial BoW was significant. It seems that the Fields Effect is fully operational with initial project narrative reports:<\/p>\n<p><a href=\"https:\/\/en.wikiquote.org\/wiki\/W._C._Fields\" target=\"_blank\" rel=\"noopener noreferrer\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft\" src=\"https:\/\/upload.wikimedia.org\/wikipedia\/commons\/thumb\/0\/04\/Wcfields36682u.jpg\/220px-Wcfields36682u.jpg\" alt=\"\" width=\"132\" height=\"190\" \/><\/a>\u201cIf you can&#8217;t dazzle them with brilliance, baffle them with bullshit.\u201d \u2015 W.C. Fields<\/p>\n<p><a href=\"https:\/\/en.wikiquote.org\/wiki\/W._C._Fields\" target=\"_blank\" rel=\"noopener noreferrer\">https:\/\/en.wikiquote.org\/wiki\/W._C._Fields<\/a><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>However, the relative BoW size of the subsequent projects was telling. My very unscientific selection process of reports to test showed that I could\u00a0classify project reports\u00a0 over the project life into below average, average, and above average.<\/p>\n<p>There looks to be a linear relationship between time and the log of the size of the project narrative report&#8217;s BoW. This Uniqueness slope represents a sustained effort to include new information into project narrative reports. It turns out that Effort was highly correlated with Uniqueness, which shouldn&#8217;t have been a surprise.<\/p>\n<figure id=\"attachment_159\" aria-describedby=\"caption-attachment-159\" style=\"width: 300px\" class=\"wp-caption alignleft\"><a href=\"https:\/\/dataexponent.com\/wp-content\/uploads\/2018\/06\/Textual-Uniqueness-of-PURLS-Reports-Over-Project-Life.jpg\" target=\"_blank\" rel=\"noopener noreferrer\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-159 size-medium\" src=\"https:\/\/dataexponent.com\/wp-content\/uploads\/2018\/06\/Textual-Uniqueness-of-PURLS-Reports-Over-Project-Life-300x227.jpg\" alt=\"\" width=\"300\" height=\"227\" srcset=\"https:\/\/dataexponent.com\/wp-content\/uploads\/2018\/06\/Textual-Uniqueness-of-PURLS-Reports-Over-Project-Life-300x227.jpg 300w, https:\/\/dataexponent.com\/wp-content\/uploads\/2018\/06\/Textual-Uniqueness-of-PURLS-Reports-Over-Project-Life-768x581.jpg 768w, https:\/\/dataexponent.com\/wp-content\/uploads\/2018\/06\/Textual-Uniqueness-of-PURLS-Reports-Over-Project-Life-1024x775.jpg 1024w, https:\/\/dataexponent.com\/wp-content\/uploads\/2018\/06\/Textual-Uniqueness-of-PURLS-Reports-Over-Project-Life-1200x908.jpg 1200w\" sizes=\"auto, (max-width: 300px) 85vw, 300px\" \/><\/a><figcaption id=\"caption-attachment-159\" class=\"wp-caption-text\">Uniqueness Over Project Life<\/figcaption><\/figure>\n<p>My graph shows trends overlaying raw data for a bunch project reports over the life of the projects. Clearly there is a below average group, an average group, and an above average group.<\/p>\n<p>Perhaps an automated project report quality checker can be developed&#8230;<\/p>\n<hr \/>\n<p>Bag-of-words model. (2017, September 8). In Wikipedia, The Free Encyclopedia. Retrieved\u00a023:20, October 12, 2017, from <a href=\"https:\/\/en.wikipedia.org\/w\/index.php?title=Bag-of-words_model&amp;oldid=799590423\" target=\"_blank\" rel=\"noopener noreferrer\">https:\/\/en.wikipedia.org\/w\/index.php?title=Bag-of-words_model&amp;oldid=799590423<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Working as a Project Manager for multiple for-profit and not-for-profit organizations I have seen a lot, I mean a lot, of project reports. Some of the places I&#8217;ve had the opportunity to work for are really in to project reporting, some not so much. Learning\u00a0Big Data tools I&#8217;ve often had questions of my own that &hellip; <a href=\"https:\/\/dataexponent.com\/?p=151\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Quick Analysis of Selected Project Reports to Determine Project Reporting \u201cQuality\u201d&#8221;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":159,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[24,10,25,42,41,29,26,27,28,9,23,33,49,30],"class_list":["post-151","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized","tag-bag-of-words","tag-big-data","tag-bow","tag-consulting","tag-custom","tag-fields","tag-project-report","tag-project-report-quality","tag-quality-uniqueness","tag-r","tag-r-language","tag-shiny","tag-software-consulting","tag-wc-fields"],"_links":{"self":[{"href":"https:\/\/dataexponent.com\/index.php?rest_route=\/wp\/v2\/posts\/151","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dataexponent.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dataexponent.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dataexponent.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/dataexponent.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=151"}],"version-history":[{"count":10,"href":"https:\/\/dataexponent.com\/index.php?rest_route=\/wp\/v2\/posts\/151\/revisions"}],"predecessor-version":[{"id":199,"href":"https:\/\/dataexponent.com\/index.php?rest_route=\/wp\/v2\/posts\/151\/revisions\/199"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dataexponent.com\/index.php?rest_route=\/wp\/v2\/media\/159"}],"wp:attachment":[{"href":"https:\/\/dataexponent.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=151"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dataexponent.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=151"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dataexponent.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=151"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}