The following historical commit information, by author, was found.

Author Commits Insertions Deletions % of changes
Emily Tew339601.20
Ian Grimstead696570.47
IanGrimstead1417734388235.34
Thanasis1319866652949.88
emily-tew1122730.59
emilytew91611520.95
jonesg222180.09
mshodge125802512.53
user62408610187410658.94
 

Below are the number of rows from each author that have survived and are still intact in the current revision.

Author Rows Stability Age % in comments
IanGrimstead339943.94.43.0042.73
Michael Hodge22100.04.30.000.28
Thanasis441244.72.910.3455.46
emily-tew4436.11.02.270.55
emilytew4729.22.70.000.59
jonesg21676.27.312.500.20
user624086150.83.10.000.19
 

The following history timeline has been gathered from the repository.

Author2018W192018W202018W212018W222018W232018W242018W252018W26
Emily Tew
 
 
Ian Grimstead...
IanGrimstead
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
Thanasis
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
emilytew.
jonesg2.
mshodge.
 
 
 
 
.
user624086
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
Modified Rows:141856620186105583683319392391
Author2018W272018W28
IanGrimstead
 
 
 
 
 
 
 
 
 
Thanasis
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
emily-tew.
emilytew.
Modified Rows:86702839

The following files are suspiciously big (in order of severity).

app_detect/tests/algorithms/test_tfidf_vv.py (1514 estimated lines of code)
app_detect/scripts/data_access/vvcode/abstracts2pickle.py (882 estimated lines of code)

The following files have an elevated cyclomatic complexity (in order of severity)

app_detect/fdg/knockout-3.4.2.js (170 in cyclomatic complexity)
app_detect/submodules/enhanced_subject_verb_object_extraction/subject_verb_object_extract.py (148 in cyclomatic complexity)
app_detect/scripts/algorithms/tf_idf.py (125 in cyclomatic complexity)
app_detect/scripts/data_access/USPatentXML2DF.py (116 in cyclomatic complexity)
app_detect/scripts/data_access/patent_data_factory.py (75 in cyclomatic complexity)
app_detect/scripts/data_access/Patents2DataFrame.py (67 in cyclomatic complexity)
app_detect/tests/data_access/test_Patents2DataFrame.py (62 in cyclomatic complexity)
app_pat2gc/scripts/SciKitClassificationAlgs.py (54 in cyclomatic complexity)
app_detect/scripts/data_access/UKPatentXML2DF.py (52 in cyclomatic complexity)

The following files have an elevated cyclomatic complexity density (in order of severity)

app_detect/fdg/knockout-3.4.2.js (1.429 in cyclomatic complexity density)
app_pat2gc/scripts/SciKitClassificationAlgs.py (1.059 in cyclomatic complexity density)

The following responsibilities, by author, were found in the current revision of the repository (comments are excluded from the line count, if possible).

IanGrimstead is mostly responsible for

app_detect/tests/algorithms/test_tfidf_vv.py (867 eloc)
app_detect/scripts/data_access/vvcode/abstracts2pickle.py (433 eloc)
app_detect/tests/data_access/test_USPatent2DF.py (218 eloc)
app_detect/scripts/data_access/USPatentXML2DF.py (186 eloc)
app_bridge/IPOPickles2DataFrame.py (183 eloc)
app_detect/scripts/data_access/Patents2DataFrame.py (164 eloc)
app_detect/scripts/algorithms/tf_idf.py (152 eloc)
app_detect/tests/data_access/test_UKPatent2DF.py (128 eloc)
app_detect/scripts/data_access/UKPatentXML2DF.py (113 eloc)
app_detect/tests/data_access/test_Patents2DataFrame.py (110 eloc)

Michael Hodge is mostly responsible for

app_detect/tests/algorithms/test_tfidf_vv.py (22 eloc)

Thanasis is mostly responsible for

app_detect/tests/algorithms/test_tfidf_vv.py (600 eloc)
app_detect/scripts/data_access/vvcode/abstracts2pickle.py (449 eloc)
app_pat2gc/scripts/textmodelproc2.py (281 eloc)
app_pat2gc/scripts/textmodelproc.py (264 eloc)
app_detect/submodules/enhanced_subject_verb_object_extraction/subject_verb_object_extract.py (254 eloc)
app_detect/fdg/f.js (186 eloc)
app_detect/submodules/enhanced_subject_verb_object_extraction/subject_verb_object_extract_test.py (148 eloc)
app_detect/detect.py (143 eloc)
app_detect/scripts/data_access/USPatentXML2DF.py (138 eloc)
app_detect/scripts/algorithms/tf_idf.py (133 eloc)

emily-tew is mostly responsible for

app_detect/scripts/algorithms/tf_idf.py (29 eloc)
app_detect/tests/data_access/test_Patents2DataFrame.py (10 eloc)
app_detect/detect.py (2 eloc)
app_detect/tests/data_access/test_XMLPatents2DF.py (1 eloc)
app_detect/tests/algorithms/test_TFIDF_sklearn.py (1 eloc)

emilytew is mostly responsible for

app_detect/tests/algorithms/test_tfidf_vv.py (25 eloc)
app_detect/scripts/data_access/patent_data_factory.py (21 eloc)
app_detect/detect.py (1 eloc)

jonesg2 is mostly responsible for

app_detect/tests/utils/test_Text2Geo.py (7 eloc)
app_detect/scripts/utils/Text2Geo.py (7 eloc)

user624086 is mostly responsible for

app_detect/detect.py (15 eloc)

The extensions below were found in the repository history (extensions used during statistical analysis are marked).

* XML bat css dtd html ipynb js json md pdf py txt xml yml