| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 1 | :mod:`filecmp` --- File and Directory Comparisons | 
 | 2 | ================================================= | 
 | 3 |  | 
 | 4 | .. module:: filecmp | 
 | 5 |    :synopsis: Compare files efficiently. | 
 | 6 | .. sectionauthor:: Moshe Zadka <moshez@zadka.site.co.il> | 
 | 7 |  | 
| Raymond Hettinger | 1048094 | 2011-01-10 03:26:08 +0000 | [diff] [blame] | 8 | **Source code:** :source:`Lib/filecmp.py` | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 9 |  | 
| Raymond Hettinger | 4f707fd | 2011-01-10 19:54:11 +0000 | [diff] [blame] | 10 | -------------- | 
 | 11 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 12 | The :mod:`filecmp` module defines functions to compare files and directories, | 
| Georg Brandl | 9afde1c | 2007-11-01 20:32:30 +0000 | [diff] [blame] | 13 | with various optional time/correctness trade-offs. For comparing files, | 
 | 14 | see also the :mod:`difflib` module. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 15 |  | 
 | 16 | The :mod:`filecmp` module defines the following functions: | 
 | 17 |  | 
 | 18 |  | 
| Georg Brandl | 71515ca | 2009-05-17 12:29:12 +0000 | [diff] [blame] | 19 | .. function:: cmp(f1, f2, shallow=True) | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 20 |  | 
 | 21 |    Compare the files named *f1* and *f2*, returning ``True`` if they seem equal, | 
 | 22 |    ``False`` otherwise. | 
 | 23 |  | 
 | 24 |    Unless *shallow* is given and is false, files with identical :func:`os.stat` | 
 | 25 |    signatures are taken to be equal. | 
 | 26 |  | 
 | 27 |    Files that were compared using this function will not be compared again unless | 
 | 28 |    their :func:`os.stat` signature changes. | 
 | 29 |  | 
 | 30 |    Note that no external programs are called from this function, giving it | 
 | 31 |    portability and efficiency. | 
 | 32 |  | 
 | 33 |  | 
| Georg Brandl | 71515ca | 2009-05-17 12:29:12 +0000 | [diff] [blame] | 34 | .. function:: cmpfiles(dir1, dir2, common, shallow=True) | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 35 |  | 
| Benjamin Peterson | e0124bd | 2009-03-09 21:04:33 +0000 | [diff] [blame] | 36 |    Compare the files in the two directories *dir1* and *dir2* whose names are | 
 | 37 |    given by *common*. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 38 |  | 
| Benjamin Peterson | e0124bd | 2009-03-09 21:04:33 +0000 | [diff] [blame] | 39 |    Returns three lists of file names: *match*, *mismatch*, | 
 | 40 |    *errors*.  *match* contains the list of files that match, *mismatch* contains | 
 | 41 |    the names of those that don't, and *errors* lists the names of files which | 
 | 42 |    could not be compared.  Files are listed in *errors* if they don't exist in | 
 | 43 |    one of the directories, the user lacks permission to read them or if the | 
 | 44 |    comparison could not be done for some other reason. | 
 | 45 |  | 
 | 46 |    The *shallow* parameter has the same meaning and default value as for | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 47 |    :func:`filecmp.cmp`. | 
 | 48 |  | 
| Benjamin Peterson | e0124bd | 2009-03-09 21:04:33 +0000 | [diff] [blame] | 49 |    For example, ``cmpfiles('a', 'b', ['c', 'd/e'])`` will compare ``a/c`` with | 
 | 50 |    ``b/c`` and ``a/d/e`` with ``b/d/e``.  ``'c'`` and ``'d/e'`` will each be in | 
 | 51 |    one of the three returned lists. | 
 | 52 |  | 
 | 53 |  | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 54 | Example:: | 
 | 55 |  | 
 | 56 |    >>> import filecmp | 
 | 57 |    >>> filecmp.cmp('undoc.rst', 'undoc.rst') | 
 | 58 |    True | 
 | 59 |    >>> filecmp.cmp('undoc.rst', 'index.rst') | 
 | 60 |    False | 
 | 61 |  | 
 | 62 |  | 
 | 63 | .. _dircmp-objects: | 
 | 64 |  | 
 | 65 | The :class:`dircmp` class | 
 | 66 | ------------------------- | 
 | 67 |  | 
 | 68 | :class:`dircmp` instances are built using this constructor: | 
 | 69 |  | 
 | 70 |  | 
| Georg Brandl | 71515ca | 2009-05-17 12:29:12 +0000 | [diff] [blame] | 71 | .. class:: dircmp(a, b, ignore=None, hide=None) | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 72 |  | 
 | 73 |    Construct a new directory comparison object, to compare the directories *a* and | 
 | 74 |    *b*. *ignore* is a list of names to ignore, and defaults to ``['RCS', 'CVS', | 
 | 75 |    'tags']``. *hide* is a list of names to hide, and defaults to ``[os.curdir, | 
 | 76 |    os.pardir]``. | 
 | 77 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 78 |    The :class:`dircmp` class provides the following methods: | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 79 |  | 
 | 80 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 81 |    .. method:: report() | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 82 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 83 |       Print (to ``sys.stdout``) a comparison between *a* and *b*. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 84 |  | 
 | 85 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 86 |    .. method:: report_partial_closure() | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 87 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 88 |       Print a comparison between *a* and *b* and common immediate | 
 | 89 |       subdirectories. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 90 |  | 
 | 91 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 92 |    .. method:: report_full_closure() | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 93 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 94 |       Print a comparison between *a* and *b* and common subdirectories | 
 | 95 |       (recursively). | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 96 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 97 |    The :class:`dircmp` offers a number of interesting attributes that may be | 
 | 98 |    used to get various bits of information about the directory trees being | 
 | 99 |    compared. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 100 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 101 |    Note that via :meth:`__getattr__` hooks, all attributes are computed lazily, | 
 | 102 |    so there is no speed penalty if only those attributes which are lightweight | 
 | 103 |    to compute are used. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 104 |  | 
 | 105 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 106 |    .. attribute:: left_list | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 107 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 108 |       Files and subdirectories in *a*, filtered by *hide* and *ignore*. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 109 |  | 
 | 110 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 111 |    .. attribute:: right_list | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 112 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 113 |       Files and subdirectories in *b*, filtered by *hide* and *ignore*. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 114 |  | 
 | 115 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 116 |    .. attribute:: common | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 117 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 118 |       Files and subdirectories in both *a* and *b*. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 119 |  | 
 | 120 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 121 |    .. attribute:: left_only | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 122 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 123 |       Files and subdirectories only in *a*. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 124 |  | 
 | 125 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 126 |    .. attribute:: right_only | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 127 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 128 |       Files and subdirectories only in *b*. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 129 |  | 
 | 130 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 131 |    .. attribute:: common_dirs | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 132 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 133 |       Subdirectories in both *a* and *b*. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 134 |  | 
 | 135 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 136 |    .. attribute:: common_files | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 137 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 138 |       Files in both *a* and *b* | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 139 |  | 
 | 140 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 141 |    .. attribute:: common_funny | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 142 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 143 |       Names in both *a* and *b*, such that the type differs between the | 
 | 144 |       directories, or names for which :func:`os.stat` reports an error. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 145 |  | 
 | 146 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 147 |    .. attribute:: same_files | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 148 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 149 |       Files which are identical in both *a* and *b*. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 150 |  | 
 | 151 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 152 |    .. attribute:: diff_files | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 153 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 154 |       Files which are in both *a* and *b*, whose contents differ. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 155 |  | 
 | 156 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 157 |    .. attribute:: funny_files | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 158 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 159 |       Files which are in both *a* and *b*, but could not be compared. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 160 |  | 
 | 161 |  | 
| Benjamin Peterson | e41251e | 2008-04-25 01:59:09 +0000 | [diff] [blame] | 162 |    .. attribute:: subdirs | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 163 |  | 
| Georg Brandl | 71515ca | 2009-05-17 12:29:12 +0000 | [diff] [blame] | 164 |       A dictionary mapping names in :attr:`common_dirs` to :class:`dircmp` | 
 | 165 |       objects. | 
| Georg Brandl | 116aa62 | 2007-08-15 14:28:22 +0000 | [diff] [blame] | 166 |  |