In a typical git-diff patch (see below), there are a few major parts.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
diff --git a/git_fix_whitespace.py b/git_fix_whitespace.py | |
index 39f6807..0c72f5f 100755 | |
--- a/git_fix_whitespace.py | |
+++ b/git_fix_whitespace.py | |
@@ -15,7 +15,7 @@ def sanitize_line(line): | |
return line | |
-def sanitize_diff(git_diff): | |
+def sanitize_diff(git_diff, git_root): | |
'''Sanitize lines in diff | |
Only files add or modify are sanitized. | |
@@ -33,8 +33,7 @@ def sanitize_diff(git_diff): | |
# Cannot find file path of working file | |
return | |
- #FIXME: file_path should be prefixed by repository root | |
- file_path = m.group(1) | |
+ file_path = os.path.join(git_root, m.group(1)) | |
backup_path = '%s.orig' % file_path | |
line_changes = {} # line no. -> sanitized line | |
@@ -106,8 +105,8 @@ def main(): | |
head_diff_add = head_diff.iter_change_type('A') | |
head_diff_modify = head_diff.iter_change_type('M') | |
- map(sanitize_diff, head_diff_add) | |
- map(sanitize_diff, head_diff_modify) | |
+ map(lambda i: sanitize_diff(i, git_root), head_diff_add) | |
+ map(lambda i: sanitize_diff(i, git_root), head_diff_modify) |
- Line 1 provides metadata about the modified file. It gives the file path to the file, rooted at the git repository.
- Line 2 provides metadata about the git index and the file object discretionary access control list.
- Line 3 and 4 provides metadata about which file path is old and which is new.
- Line 5, 14, and 24 are metadata that tells which part of the file is modified. Let's take an example to explain this - "@@ -33,8 +33,7 @@ def sanitize_diff(git_diff):". "-33,8" tells that there is a diff hunk with 8 lines starts at line 33 of the old version of the file. "+33,7" tells that there is a diff hunk with 7 lines starts at line 33 of the new version of the file. As a result, the new version of the file gets one line fewer than the old version.
- The rest of the patch is the content in the modified file. Those lines are either prefixed by ' ', '-', or '+'. ' ' means no modification, '-' means removed line, and '+' means added line. Note, there is no line replacement since it is represented by '-' lines followed by '+' lines (see line 28-31).