uscan(1)



NAME

   uscan - scan/watch upstream sources for new releases of software

SYNOPSIS

   uscan [options] [path]

DESCRIPTION

   For basic usage, uscan is executed without any arguments from the root
   of the Debianized source tree where you see the debian/ directory.
   Then typically the following happens:

   *   uscan reads the first entry in debian/changelog to determine the
       source package name <spkg> and the last upstream version.

   *   uscan process the watch lines debian/watch from the top to the
       bottom in a single pass.

       *   uscan downloads a web page from the specified URL in
           debian/watch.

       *   uscan extracts hrefs pointing to the upstream tarball(s) from
           the web page using the specified matching-pattern in
           debian/watch.

       *   uscan downloads the upstream tarball with the highest version
           newer than the last upstream version.

       *   uscan saves the downloaded tarball to the parent ../ directory:
           ../<upkg>-<uversion>.tar.gz

       *   uscan invokes mk-origtargz to create the source tarball:
           ../<spkg>_<oversion>.orig.tar.gz

           *   For a multiple upstream tarball (MUT) package, the
               secondary upstream tarball will instead be named
               ../<spkg>_<oversion>.orig-<component>.tar.gz.

       *   Repeat until all lines in debian/watch are processed.

   *   uscan invokes uupdate to create the Debianized source tree:
       ../<spkg>-<oversion>/*

   Please note the following.

   *   For simplicity, the compression method used in examples is gzip
       with .gz suffix.  Other methods such as xz, bzip2, and lzma with
       corresponding xz, bz2, and lzma suffixes may also be used.

   *   The new version=4 enables handling of multiple upstream tarball
       (MUT) packages but this is a rare case for Debian packaging.  For a
       single upstream tarball package, there is only one watch line and
       no ../<spkg>_<oversion>.orig-<component>.tar.gz .

   *   uscan with the --verbose option produces a human readable report of
       uscan's execution.

   *   uscan with the --debug option produces a human readable report of
       uscan's execution including internal variable states.

   *   uscan with the --dehs option produces an upstream package status
       report in XML format for other programs such as the Debian External
       Health System.

   *   The primary objective of uscan is to help identify if the latest
       version upstream tarball is used or not; and to download the latest
       upstream tarball.  The ordering of versions is decided by dpkg
       --compare-versions.

   *   uscan with the --safe option limits the functionality of uscan to
       its primary objective.  Both the repacking of downloaded files and
       updating of the source tree are skipped to avoid running unsafe
       scripts.  This also changes the default to --no-download and
       --skip-signature.

FORMAT OF THE WATCH FILE

   The current version 4 format of debian/watch can be summarized as
   follows:

   *   Leading spaces and tabs are dropped.

   *   Empty lines are dropped.

   *   A line started by # (hash) is a comment line and dropped.

   *   A single \ (back slash) at the end of a line is dropped and the
       next line is concatenated after removing leading spaces and tabs.
       The concatenated line is parsed as a single line. (The existence or
       non-existence of the space before the tailing single \ is
       significant.)

   *   The first non-comment line is:

       version=4

       This is required.

   *   The following non-comment lines (watch lines) specify the rules for
       the selection of the candidate upstream tarball URLs and are in one
       of the following three formats:

       *   opts=" ... " http://URL matching-pattern [version [script]]

       *   http://URL matching-pattern [version [script]]

       *   opts=" ... "

       Here,

       *   opts=" ... " specifies the behavior of uscan.  See "WATCH FILE
           OPTIONS".

       *   http://URL specifies the web page where upstream publishes the
           link to the latest source archive.

           *   https://URL may also be used, as may

           *   ftp://URL

           *   Some parts of URL may be in the regex match pattern
               surrounded between ( and ) such as /foo/bar-([\.\d]+)/.
               (If multiple directories match, the highest version is
               picked.) Otherwise, the URL is taken as verbatim.

       *   matching-pattern specifies the full string matching pattern for
           hrefs in the web page.  See "WATCH FILE EXAMPLES".

           *   All matching parts in ( and ) are concatenated with .
               (period) to form the upstream version.

           *   If the hrefs do not contain directories, you can combine
               this with the previous entry. I.e., http://URL/matching-
               pattern .

       *   version restricts the upstream tarball which may be downloaded.
           The newest available version is chosen in each case.

           *   debian requires the downloading upstream tarball to be
               newer than the version obtained from debian/changelog.

           *   version-number such as 12.5 requires the upstream tarball
               to be newer than the version-number.

           *   same requires the downloaded version of the secondary
               tarballs to be exactly the same as the one for the first
               upstream tarball downloaded. (Useful only for MUT)

           *   previous restricts the version of the signature file. (Used
               with pgpmode=previous)

           *   ignore does not restrict the version of the secondary
               tarballs. (Maybe useful for MUT)

       *   script is executed at the end of uscan execution with
           appropriate arguments provided by uscan.

           *   The typical Debian package is a non-native package made
               from one upstream tarball.  Only a single line of the watch
               line in one of the first two formats is usually used with
               its version set to debian and script set to uupdate.

           *   A native package should not specify script.

           *   A multiple upstream tarball (MUT) package should specify
               uupdate as script in the last watch line and should skip
               specifying script in the rest of the watch lines.

       *   The last format of the watch line is useful to set the
           persistent parameters: user-agent, compression.  If this format
           is used, this must be followed by the URL defining watch
           line(s).

       *   [ and ] in the above format are there to mark the optional
           parts and should not be typed.

   There are a few special strings which are substituted by uscan to make
   it easy to write the watch file.

   @PACKAGE@
       This is substituted with the source package name found in the first
       line of the debian/changelog file.

   @ANY_VERSION@
       This is substituted by the legal upstream version regex
       (capturing).

         [-_]?(\d[\-+\.:\~\da-zA-Z]*)

   @ARCHIVE_EXT@
       This is substituted by the typical archive file extension regex
       (non-capturing).

         (?i)\.(?:tar\.xz|tar\.bz2|tar\.gz|zip)

   @SIGNATURE_EXT@
       This is substituted by the typical signature file extension regex
       (non-capturing).

         (?i)\.(?:tar\.xz|tar\.bz2|tar\.gz|zip)\.(?:asc|pgp|gpg|sig)

   Some file extensions are not included in the above intentionally to
   avoid false positives.  You can still set such file extension patterns
   manually.

WATCH FILE OPTIONS

   uscan reads the watch options specified in opts=" ... " to customize
   its behavior. Multiple options option1, option2, option3, ... can be
   set as opts="option1, option2, option3,  ...  " .  The double quotes
   are necessary if options contain any spaces.

   Unless otherwise noted as persistent, most options are valid only
   within their containing watch line.

   The available watch options are:

   component=component
       Set the name of the secondary source tarball as
       <spkg>_<oversion>.orig-<component>.tar.gz for a MUT package.

   compression=method
       Set the compression method when the tarball is repacked
       (persistent).

       Available method values are xz, gzip (alias gz), bzip2 (alias bz2),
       and lzma.  The default is gzip for normal tarballs, and xz for
       tarballs generated directly from a git repository.

       If the debian source format is not 1.0, setting this to xz should
       help reduce the package size when the package is repacked.

       Please note the repacking of the upstream tarballs by mk-origtargz
       happens only if one of the following conditions is satisfied:

       *   USCAN_REPACK is set in the devscript configuration.  See
           "DEVSCRIPT CONFIGURATION VARIABLES".

       *   --repack is set on the commandline.  See <COMMANDLINE OPTIONS>.

       *   repack is set in the watch line as opts="repack,...".

       *   The upstream archive is of zip type including jar, xpi, ...

       *   Files-Excluded or Files-Excluded-component stanzas are set in
           debian/copyright to make mk-origtargz invoked from uscan remove
           files from the upstream tarball and repack it.  See "COPYRIGHT
           FILE EXAMPLES" and mk-origtargz(1).

   repack
       Force repacking of the upstream tarball using the compression
       method.

   repacksuffix=suffix
       Add suffix to the Debian package upstream version only when the
       source tarball is repackaged.  This rule should be used only for a
       single upstream tarball package.

   mode=mode
       Set the archive download mode.

       LWP This mode is the default one which downloads the specified
           tarball from the archive URL on the web.

       git This mode accesses the upstream git archive directly with the
           git command and packs the source tree with the specified tag
           into spkg-version.tar.xz.

           If the upstream publishes the released tarball via its web
           interface, please use it instead of using this mode.  This mode
           is the last resort method.

   pgpmode=mode
       Set the PGP/GPG signature verification mode.

       auto
           uscan checks possible URLs for the signature file and
           autogenerates a pgpsigurlmangle rule to use it.

       default
           Use pgpsigurlmangle=rules to generate the candidate upstream
           signature file URL string from the upstream tarball URL.
           (default)

           If the specified pgpsigurlmangle is missing, uscan checks
           possible URLs for the signature file and suggests adding a
           pgpsigurlmangle rule.

       mangle
           Use pgpsigurlmangle=rules to generate the candidate upstream
           signature file URL string from the upstream tarball URL.

       next
           Verify this downloaded tarball file with the signature file
           specified in the next watch line.  The next watch line must be
           pgpmode=previous.  Otherwise, no verification occurs.

       previous
           Verify the downloaded tarball file specified in the previous
           watch line with this signature file.  The previous watch line
           must be pgpmode=next.

       self
           Verify the downloaded file foo.ext with its self signature and
           extract its content tarball file as foo.

       none
           No signature available. (No warning.)

   decompress
       Decompress compressed archive before the pgp/gpg signature
       verification.

   bare
       Disable all site specific special case code such as URL redirector
       uses and page content alterations. (persistent)

   user-agent=user-agent-string
       Set the user-agent string used to contact the HTTP(S) server as
       user-agent-string. (persistent)

       user-agent option should be specified by itself in the watch line
       without URL, to allow using semicolons and commas in it.

   pasv, passive
       Use PASV mode for the FTP connection.

       If PASV mode is required due to the client side network
       environment, set uscan to use PASV mode via "COMMANDLINE OPTIONS"
       or "DEVSCRIPT CONFIGURATION VARIABLES" instead.

   active, nopasv
       Don't use PASV mode for the FTP connection.

   unzipopt=options
       Add the extra options to use with the unzip command, such as -a,
       -aa, and -b, when executed by mk-origtargz.

   dversionmangle=rules
       Normalize the last upstream version string found in
       debian/changelog to compare it to the available upstream tarball
       version.  Removal of the Debian specific suffix such as
       s/\+dfsg\d*$// is usually done here.

   dirversionmangle=rules
       Normalize the directory path string matching the regex in a set of
       parentheses of http://URL as the sortable version index string.
       This is used as the directory path sorting index only.

       Substitution such as s/PRE/~pre/; s/RC/~rc/ may help.

   pagemangle=rules
       Normalize the downloaded web page string.  (Don't use this unless
       this is absolutely needed.  Generally, g flag is required for these
       rules.)

       This is handy if you wish to access Amazon AWS or Subversion
       repositories in which <a href="..."> is not used.

   uversionmangle=rules
       Normalize the candidate upstream version strings extracted from
       hrefs in the source of the web page.  This is used as the version
       sorting index when selecting the latest upstream version.

       Substitution such as s/PRE/~pre/; s/RC/~rc/ may help.

   versionmangle=rules
       Syntactic shorthand for uversionmangle=rules, dversionmangle=rules

   downloadurlmangle=rules
       Convert the selected upstream tarball href string into the
       accessible URL for obfuscated web sites.

   filenamemangle=rules
       Generate the upstream tarball filename from the selected href
       string if matching-pattern can extract the latest upstream version
       <uversion> from the selected href string.  Otherwise, generate the
       upstream tarball filename from its full URL string and set the
       missing <uversion> from the generated upstream tarball filename.

       Without this option, the default upstream tarball filename is
       generated by taking the last component of the URL and removing
       everything after any '?' or '#'.

   pgpsigurlmangle=rules
       Generate the candidate upstream signature file URL string from the
       upstream tarball URL.

   oversionmangle=rules
       Generate the version string <oversion> of the source tarball
       <spkg>_<oversion>.orig.tar.gz from <uversion>.  This should be used
       to add a suffix such as +dfsg1 to a MUT package.

   Here, the mangling rules apply the rules to the pertinent string.
   Multiple rules can be specified in a mangling rule string by making a
   concatenated string of each mangling rule separated by ; (semicolon).

   Each mangling rule cannot contain ; (semicolon), , (comma), or "
   (double quote).

   Each mangling rule behaves as if a Perl command "$string =~ rule" is
   executed.  There are some notable details.

   *   rule may only use the s, tr, and y operations.

       s/regex/replacement/options
           Regex pattern match and replace the target string.  Only the g,
           i and x flags are available.  Use the $1 syntax for back
           references (No \1 syntax).  Code execution is not allowed (i.e.
           no (?{}) or (??{}) constructs).

       y/source/dest/ or tr/source/dest/
           Transliterate the characters in the target string.

EXAMPLE OF EXECUTION

   uscan reads the first entry in debian/changelog to determine the source
   package name and the last upstream version.

   For example, if the first entry of debian/changelog is:

   *   bar (3:2.03+dfsg1-4) unstable; urgency=low

   then, the source package name is bar and the last Debian package
   version is 3:2.03+dfsg1-4.

   The last upstream version is normalized to 2.03+dfsg1 by removing the
   epoch and the Debian revision.

   If the dversionmangle rule exists, the last upstream version is further
   normalized by applying this rule to it.  For example, if the last
   upstream version is 2.03+dfsg1 indicating the source tarball is
   repackaged, the suffix +dfsg1 is removed by the string substitution
   s/\+dfsg\d*$// to make the (dversionmangled) last upstream version 2.03
   and it is compared to the candidate upstream tarball versions such as
   2.03, 2.04, ... found in the remote site.  Thus, set this rule as:

   *   opts="dversionmangle=s/\+dfsg\d*$//"

   uscan downloads a web page from http://URL specified in debian/watch.

   *   If the directory name part of URL has no parentheses, ( and ), it
       is taken as verbatim.

   *   If the directory name part of URL has parentheses, ( and ), then
       uscan recursively searches all possible directories to find a page
       for the newest version.  If the dirversionmangle rule exists, the
       generated sorting index is used to find the newest version.  If a
       specific version is specified for the download, the matching
       version string has priority over the newest version.

   For example, this http://URL may be specified as:

   *   http://www.example.org/([\d\.]+)/

   Please note the trailing / in the above to make ([\d\.]+) as the
   directory.

   If the pagemangle rule exists, the whole downloaded web page as a
   string is normalized by applying this rule to it.  This is very
   powerful tool and needs to be used with caution.  If other mangling
   rules can be used to address your objective, do not use this rule.

   The downloaded web page is scanned for hrefs defined in the <a href="
   ... "> tag to locate the candidate upstream tarball hrefs.  These
   candidate upstream tarball hrefs are matched by the Perl regex pattern
   matching-pattern such as DL-(?:[\d\.]+?)/foo-(.+)\.tar\.gz to narrow
   down the candidates.  This pattern match needs to be anchored at the
   beginning and the end.  For example, candidate hrefs may be:

   *   DL-2.02/foo-2.02.tar.gz

   *   DL-2.03/foo-2.03.tar.gz

   *   DL-2.04/foo-2.04.tar.gz

   Here the matching string of (.+) in matching-pattern is considered as
   the candidate upstream version.  If there are multiple matching strings
   of capturing patterns in matching-pattern, they are all concatenated
   with .  (period) to form the candidate upstream version.  Make sure to
   use the non-capturing regex such as (?:[\d\.]+?) instead for the
   variable text matching part unrelated to the version.

   Then, the candidate upstream versions are:

   *   2.02

   *   2.03

   *   2.04

   The downloaded tarball filename is basically set to the same as the
   filename in the remote URL of the selected href.

   If the uversionmangle rule exists, the candidate upstream versions are
   normalized by applying this rule to them. (This rule may be useful if
   the upstream version scheme doesn't sort correctly to identify the
   newest version.)

   The upstream tarball href corresponding to the newest (uversionmangled)
   candidate upstream version newer than the (dversionmangled) last
   upstream version is selected.

   If multiple upstream tarball hrefs corresponding to a single version
   with different extensions exist, the highest compression one is chosen.
   (Priority: tar.xz > tar.lzma > tar.bz2 > tar.gz.)

   If the selected upstream tarball href is the relative URL, it is
   converted to the absolute URL using the base URL of the web page.  If
   the <base href="  ...  "> tag exists in the web page, the selected
   upstream tarball href is converted to the absolute URL using the
   specified base URL in the base tag, instead.

   If the downloadurlmangle rule exists, the selected upstream tarball
   href is normalized by applying this rule to it. (This is useful for
   some sites with the obfuscated download URL.)

   If the filenamemangle rule exists, the downloaded tarball filename is
   generated by applying this rule to the selected href if matching-
   pattern can extract the latest upstream version <uversion> from the
   selected href string. Otherwise, generate the upstream tarball filename
   from its full URL string and set the missing <uversion> from the
   generated upstream tarball filename.

   Without the filenamemangle rule, the default upstream tarball filename
   is generated by taking the last component of the URL and removing
   everything after any '?' or '#'.

   uscan downloads the selected upstream tarball to the parent ../
   directory.  For example, the downloaded file may be:

   *   ../foo-2.04.tar.gz

   Let's call this downloaded version 2.04 in the above example
   generically as <uversion> in the following.

   If the pgpsigurlmangle rule exists, the upstream signature file URL is
   generated by applying this rule to the (downloadurlmangled) selected
   upstream tarball href and the signature file is tried to be downloaded
   from it.

   If the pgpsigurlmangle rule doesn't exist, uscan warns user if the
   matching upstream signature file is available from the same URL with
   their filename being suffixed by the 4 common suffix asc, gpg, pgp, and
   sig. (You can avoid this warning by setting pgpmode=none.)

   If the signature file is downloaded, the downloaded upstream tarball is
   checked for its authenticity against the downloaded signature file
   using the keyring debian/upstream/signing-key.pgp or the armored
   keyring debian/upstream/signing-key.asc  (see "KEYRING FILE EXAMPLES").
   If its signature is not valid, or not made by one of the listed keys,
   uscan will report an error.

   If the oversionmangle rule exists, the source tarball version oversion
   is generated from the downloaded upstream version uversion by applying
   this rule. This rule is useful to add suffix such as +dfsg1 to the
   version of all the source packages of the MUT package for which the
   repacksuffix mechanism doesn't work.

   uscan invokes mk-origtargz to create the source tarball properly named
   for the source package with .orig. (or .orig-<component>. for the
   secondary tarballs) in its filename.

   case A: packaging of the upstream tarball as is
       mk-origtargz creates a symlink ../bar_<oversion>.orig.tar.gz linked
       to the downloaded local upstream tarball. Here, bar is the source
       package name found in debian/changelog. The generated symlink may
       be:

       *   ../bar_2.04.orig.tar.gz -> foo-2.04.tar.gz (as is)

       Usually, there is no need to set up opts="dversionmangle= ... " for
       this case.

   case B: packaging of the upstream tarball after removing non-DFSG files
       mk-origtargz checks the filename glob of the Files-Excluded stanza
       in the first section of debian/copyright, removes matching files to
       create a repacked upstream tarball.  Normally, the repacked
       upstream tarball is renamed with suffix to
       ../bar_<oversion><suffix>.orig.tar.gz using the repacksuffix option
       for the single upstream package.    Here <oversion> is updated to
       be <oversion><suffix>.

       The removal of files is required if files are not DFSG-compliant.
       For such case, +dfsg1 is used as suffix.

       So the combined options are set as
       opts="dversionmangle=s/\+dfsg\d*$// ,repacksuffix=+dfsg1", instead.

       For example, the repacked upstream tarball may be:

       *   ../bar_2.04+dfsg1.orig.tar.gz (repackaged)

   uscan normally invokes "uupdate --find --upstream-version oversion "
   for the version=4 watch file.

   Please note that --find option is used here since mk-origtargz has been
   invoked to make *.orig.tar.gz file already.  uscan picks bar from
   debian/changelog.

   It creates the new upstream source tree under the ../bar-<oversion>
   directory and Debianize it leveraging the last package contents.

WATCH FILE EXAMPLES

   When writing the watch file, you should rely on the latest upstream
   source announcement web page.  You should not try to second guess the
   upstream archive structure if possible.  Here are the typical
   debian/watch files.

   Please note that executing uscan with -v or -vv reveals what exactly
   happens internally.

   The existence and non-existence of a space the before tailing \ (back
   slash) are significant.

   HTTP site (basic)
   Here is an example for the basic single upstream tarball.

     version=4
     http://example.com/~user/release/foo.html \
         files/foo-([\d\.]+)\.tar\.gz debian uupdate

   Or using the special strings:

     version=4
     http://example.com/~user/release/@PACKAGE@.html \
         files/@PACKAGE@@ANY_VERSION@@ARCHIVE_EXT@ debian uupdate

   For the upstream source package foo-2.0.tar.gz, this watch file
   downloads and creates the Debian orig.tar file foo_2.0.orig.tar.gz.

   HTTP site (pgpsigurlmangle)
   Here is an example for the basic single upstream tarball with the
   matching signature file in the same file path.

     version=4
     opts="pgpsigurlmangle=s%$%.asc%" http://example.com/release/@PACKAGE@.html \
         files/@PACKAGE@@ANY_VERSION@@ARCHIVE_EXT@ debian uupdate

   For the upstream source package foo-2.0.tar.gz and the upstream
   signature file foo-2.0.tar.gz.asc, this watch file downloads these
   files, verifies the authenticity using the keyring
   debian/upstream-key.pgp and creates the Debian orig.tar file
   foo_2.0.orig.tar.gz.

   HTTP site (pgpmode=next/previous)
   Here is an example for the basic single upstream tarball with the
   matching signature file in the unrelated file path.

     version=4
     opts="pgpmode=next" http://example.com/release/@PACKAGE@.html \
         files/(?:\d+)/@PACKAGE@@ANY_VERSION@@ARCHIVE_EXT@ debian
     opts="pgpmode=previous" http://example.com/release/@PACKAGE@.html \
         files/(?:\d+)/@PACKAGE@@ANY_VERSION@@SIGNATURE_EXT@ previous uupdate

   (?:\d+) part can be any random value.  The tarball file can have 53,
   while the signature file can have 33.

   ([\d\.]+) part for the signature file has a strict requirement to match
   that for the upstream tarball specified in the previous line by having
   previous as version in the watch line.

   HTTP site (flexible)
   Here is an example for the maximum flexibility of upstream tarball and
   signature file extensions.

     version=4
     opts="pgpmode=next" http://example.com/DL/ \
         files/(?:\d+)/@PACKAGE@@ANY_VERSION@@ARCHIVE_EXT@ debian
     opts="pgpmode=previous" http://example.com/DL/ \
         files/(?:\d+)/@PACKAGE@@ANY_VERSION@@SIGNATURE_EXT@ \
         previous uupdate

   HTTP site (basic MUT)
   Here is an example for the basic multiple upstream tarballs.

     version=4
     opts="pgpsigurlmangle=s%$%.sig%" \
         http://example.com/release/foo.html \
         files/foo-([\d\.]+)\.tar\.gz debian
     opts="pgpsigurlmangle=s%$%.sig%, component=bar" \
         http://example.com/release/foo.html \
         files/foobar-([\d\.]+)\.tar\.gz same
     opts="pgpsigurlmangle=s%$%.sig%, component=baz" \
         http://example.com/release/foo.html \
         files/foobaz-([\d\.]+)\.tar\.gz same uupdate

   For the main upstream source package foo-2.0.tar.gz and the secondary
   upstream source packages foobar-2.0.tar.gz and foobaz-2.0.tar.gz which
   install under bar/ and baz/, this watch file downloads and creates the
   Debian orig.tar file foo_2.0.orig.tar.gz, foo_2.0.orig-bar.tar.gz and
   foo_2.0.orig-baz.tar.gz.  Also, these upstream tarballs are verified by
   their signature files.

   HTTP site (recursive directory scanning)
   Here is an example with the recursive directory scanning for the
   upstream tarball and its signature files released in a directory named
   after their version.

     version=4
     opts="pgpsigurlmangle=s%$%.sig%, dirversionmangle=s/-PRE/~pre/;s/-RC/~rc/" \
         http://tmrc.mit.edu/mirror/twisted/Twisted/([\d+\.]+)/ \
         Twisted-([\d\.]+)\.tar\.xz debian uupdate

   Here, the web site should be accessible at the following URL:

     http://tmrc.mit.edu/mirror/twisted/Twisted/

   Here, dirversionmangle option is used to normalize the sorting order of
   the directory names.

   HTTP site (alternative shorthand)
   For the bare HTTP site where you can directly see archive filenames,
   the normal watch file:

     version=4
     opts="pgpsigurlmangle=s%$%.sig%" \
         http://www.cpan.org/modules/by-module/Text/ \
         Text-CSV_XS-(.+)\.tar\.gz \
         debian uupdate

   can be rewritten in an alternative shorthand form:

     version=4
     opts="pgpsigurlmangle=s%$%.sig%" \
         http://www.cpan.org/modules/by-module/Text/\
         Text-CSV_XS-(.+)\.tar\.gz \
         debian uupdate

   Please note that matching-pattern of the first example doesn't have
   directory and the subtle difference of a space before the tailing \.

   HTTP site (funny version)
   For a site which has funny version numbers, the parenthesized groups
   will be joined with . (period) to make a sanitized version number.

     version=4
     http://www.site.com/pub/foobar/foobar_v(\d+)_(\d+)\.tar\.gz \
     debian uupdate

   HTTP site (DFSG)
   The upstream part of the Debian version number can be mangled to
   indicate the source package was repackaged to clean up non-DFSG files:

     version=4
     opts="dversionmangle=s/\+dfsg\d*$//,repacksuffix=+dfsg1" \
     http://some.site.org/some/path/foobar-(.+)\.tar\.gz debian uupdate

   See "COPYRIGHT FILE EXAMPLES".

   HTTP site (filenamemangle)
   The upstream tarball filename is found by taking the last component of
   the URL and removing everything after any '?' or '#'.

   If this does not fit to you, use filenamemangle.  For example, <A
   href="http://foo.bar.org/dl/?path=&dl=foo-0.1.1.tar.gz"> could be
   handled as:

     version=4
     opts=filenamemangle=s/.*=(.*)/$1/ \
     http://foo.bar.org/dl/\?path=&dl=foo-(.+)\.tar\.gz \
     debian uupdate

   <A href="http://foo.bar.org/dl/?path=&dl_version=0.1.1"> could be
   handled as:

     version=4
     opts=filenamemangle=s/.*=(.*)/foo-$1\.tar\.gz/ \
     http://foo.bar.org/dl/\?path=&dl_version=(.+) \
     debian uupdate

   If the href string has no version using <I>matching-pattern>, the
   version can be obtained from the full URL using filenamemangle.

     version=4
     opts=filenamemangle=s&.*/dl/(.*)/foo\.tar\.gz&foo-$1\.tar\.gz& \
     http://foo.bar.org/dl/([\.\d]+)/ foo.tar.gz \
     debian uupdate

   HTTP site (downloadurlmangle)
   The option downloadurlmangle can be used to mangle the URL of the file
   to download.  This can only be used with http:// URLs.  This may be
   necessary if the link given on the web page needs to be transformed in
   some way into one which will work automatically, for example:

     version=4
     opts=downloadurlmangle=s/prdownload/download/ \
     http://developer.berlios.de/project/showfiles.php?group_id=2051 \
     http://prdownload.berlios.de/softdevice/vdr-softdevice-(.+).tgz \
     debian uupdate

   HTTP site (oversionmangle, MUT)
   The option oversionmangle can be used to mangle the version of the
   source tarball (.orig.tar.gz and .orig-bar.tar.gz).  For example,
   +dfsg1 can be added to the upstream version as:

     version=4
     opts=oversionmangle=s/(.*)/$1+dfsg1/ \
     http://example.com/~user/release/foo.html \
     files/foo-([\d\.]*).tar.gz debian
     opts="component=bar" \
     http://example.com/~user/release/foo.html \
     files/bar-([\d\.]*).tar.gz same uupdate

   See "COPYRIGHT FILE EXAMPLES".

   HTTP site (pagemangle)
   The option pagemangle can be used to mangle the downloaded web page
   before applying other rules.  The non-standard web page without proper
   <a href=" << ... >> "> entries can be converted.  For example, if
   foo.html uses <a bogus=" ... ">, this can be converted to the standard
   page format with:

     version=4
     opts=pagemangle="s/<a\s+bogus=/<a href=/g" \
     http://example.com/release/@PACKAGE@.html \
     files/@PACKAGE@@ANY_VERSION@@ARCHIVE_EXT@ debian uupdate

   Please note the use of g here to replace all occurrences.

   If foo.html uses <Key> ... </Key>, this can be converted to the
   standard page format with:

     version=4
     opts="pagemangle=s%<Key>([^<]*)</Key>%<Key><a href="$1">$1</a></Key>%g" \\
     http://localhost:$PORT/ \
     (?:.*)/@PACKAGE@@ANY_VERSION@@ARCHIVE_EXT@ debian uupdate

   FTP site (basic):
     version=4
     ftp://ftp.tex.ac.uk/tex-archive/web/c_cpp/cweb/cweb-(.+)\.tar\.gz \
     debian uupdate

   FTP site (regex special characters):
     version=4
     ftp://ftp.worldforge.org/pub/worldforge/libs/\
     Atlas-C++/transitional/Atlas-C\+\+-(.+)\.tar\.gz debian uupdate

   Please note that this URL is connected to be  ... libs/Atlas-C++/ ...
   . For ++, the first one in the directory path is verbatim while the one
   in the filename is escaped by \.

   FTP site (funny version)
   This is another way of handling site with funny version numbers, this
   time using mangling.  (Note that multiple groups will be concatenated
   before mangling is performed, and that mangling will only be performed
   on the basename version number, not any path version numbers.)

     version=4
     opts="uversionmangle=s/^/0.0./" \
     ftp://ftp.ibiblio.org/pub/Linux/ALPHA/wine/\
     development/Wine-(.+)\.tar\.gz debian uupdate

   sf.net
   For SourceForge based projects, qa.debian.org runs a redirector which
   allows a simpler form of URL. The format below will automatically be
   rewritten to use the redirector with the watch file:

     version=4
     http://sf.net/<project>/ <tar-name>-(.+)\.tar\.gz debian uupdate

   For audacity, set the watch file as:

     version=4
     http://sf.net/audacity/ audacity-minsrc-(.+)\.tar\.gz debian uupdate

   Please note, you can still use normal functionalities of uscan to set
   up a watch file for this site without using the redirector.

     version=4
     opts="uversionmangle=s/-pre/~pre/, \
           filenamemangle=s%(?:.*)audacity-minsrc-(.+)\.tar\.xz/download%\
                            audacity-$1.tar.xz%" \
           http://sourceforge.net/projects/audacity/files/audacity/(\d[\d\.]+)/ \
           (?:.*)audacity-minsrc-([\d\.]+)\.tar\.xz/download debian uupdate

   Here, % is used as the separator instead of the standard /.

   github.com
   For GitHub based projects, you can use the tags or releases page.  The
   archive URL uses only the version as the filename.  You can rename the
   downloaded upstream tarball from into the standard
   <project>-<version>.tar.gz using filenamemangle:

     version=4
     opts="filenamemangle=s%(?:.*?)?v?(\d[\d.]*)\.tar\.gz%<project>-$1.tar.gz%" \
         https://github.com/<user>/<project>/tags \
         (?:.*?/)?v?(\d[\d.]*)\.tar\.gz debian uupdate

   PyPI
   For PyPI based projects, pypi.debian.net runs a redirector which allows
   a simpler form of URL. The format below will automatically be rewritten
   to use the redirector with the watch file:

     version=4
     https://pypi.python.org/packages/source/<initial>/<project>/ \
         <tar-name>-(.+)\.tar\.gz debian uupdate

   For cfn-sphere, set the watch file as:

     version=4
     https://pypi.python.org/packages/source/c/cfn-sphere/ \
         cfn-sphere-([\d\.]+).tar.gz debian uupdate

   Please note, you can still use normal functionalities of uscan to set
   up a watch file for this site without using the redirector.

     version=4
     opts="pgpmode=none" \
         https://pypi.python.org/pypi/cfn-sphere/ \
         https://pypi.python.org/packages/source/c/cfn-sphere/\
         cfn-sphere-([\d\.]+).tar.gz#.* debian uupdate

   code.google.com
   Sites which used to be hosted on the Google Code service should have
   migrated to elsewhere (github?).  Please look for the newer upstream
   site.

   direct access to the git repository
   If the upstream only publishes its code via the git repository and it
   has no web interface to obtain the release tarball, you can use uscan
   with the tags of the git repository.

     version=4
     opts="mode=git, pgpmode=none" \
     http://git.ao2.it/tweeper.git \
     refs/tags/v([\d\.]+) debian uupdate

   Please note "git ls-remote" is used to obtain references for tags.  If
   a tag v20.5 is the newest tag, the above example downloads
   spkg-20.5.tar.xz.

COPYRIGHT FILE EXAMPLES

   Here is an example for the debian/copyright file which initiates
   automatic repackaging of the upstream tarball into
   <spkg>_<oversion>.orig.tar.gz (In debian/copyright, the Files-Excluded
   and Files-Excluded-component stanzas are a part of the first paragraph
   and there is a blank line before the following paragraphs which contain
   Files and other stanzas.):

     Format: http://www.debian.org/doc/packaging-manuals/copyright-format/1.0/
     Files-Excluded: exclude-this
      exclude-dir
      */exclude-dir
      .*
      */js/jquery.js

      Files: *
      Copyright: ...
      ...

   Here is another example for the debian/copyright file which initiates
   automatic repackaging of the multiple upstream tarballs into
   <spkg>_<oversion>.orig.tar.gz and <spkg>_<oversion>.orig-bar.tar.gz:

     Format: http://www.debian.org/doc/packaging-manuals/copyright-format/1.0/
     Files-Excluded: exclude-this
      exclude-dir
      */exclude-dir
      .*
      */js/jquery.js
     Files-Excluded-bar: exclude-this
      exclude-dir
      */exclude-dir
      .*
      */js/jquery.js

      Files: *
      Copyright: ...
      ...

   See mk-origtargz(1).

KEYRING FILE EXAMPLES

   Let's assume that the upstream "uscan test key (no secret)
   <none@debian.org>" signs its package and publishes its public key
   fingerprint 'CF21 8F0E 7EAB F584 B7E2 0402 C77E 2D68 7254 3FAF' which
   you know is the trusted one.

   Please note that the short keyid 72543FAF is the last 4 Bytes, the long
   keyid C77E2D6872543FAF is the last 8 Bytes, and the finger print is the
   last 20 Bytes of the public key in hexadecimal form.  You can save
   typing by using the short keyid but you must verify the OpenPGP key
   using its fingerprint.

   The armored keyring file debian/upstream/signing-key.asc can be created
   by using the gpg (or gpg2) command as follows.

     $ gpg --recv-keys "72543FAF"
     ...
     $ gpg --finger "72543FAF"
     pub   4096R/72543FAF 2015-09-02
           Key fingerprint = CF21 8F0E 7EAB F584 B7E2  0402 C77E 2D68 7254 3FAF
     uid                  uscan test key (no secret) <none@debian.org>
     sub   4096R/52C6ED39 2015-09-02
     $ cd path/to/<upkg>-<uversion>
     $ mkdir -p debian/upstream
     $ gpg --export --export-options export-minimal --armor \
           'CF21 8F0E 7EAB F584 B7E2  0402 C77E 2D68 7254 3FAF' \
           >debian/upstream/signing-key.asc

   The binary keyring file can be created instead by skipping --armor and
   changing the storing file to debian/upstream/signing-key.pgp in the
   above example.  If a group of developers sign the package, you need to
   list fingerprints of all of them in the argument for gpg --export ...
   to make the keyring to contain all OpenPGP keys of them.

   Sometimes you may wonder who made a signature file.  You can get the
   public keyid used to create the detached signature file
   foo-2.0.tar.gz.asc by running gpg as:

     $ gpg -vv foo-2.0.tar.gz.asc
     gpg: armor: BEGIN PGP SIGNATURE
     gpg: armor header: Version: GnuPG v1
     :signature packet: algo 1, keyid C77E2D6872543FAF
           version 4, created 1445177469, md5len 0, sigclass 0x00
           digest algo 2, begin of digest 7a c7
           hashed subpkt 2 len 4 (sig created 2015-10-18)
           subpkt 16 len 8 (issuer key ID C77E2D6872543FAF)
           data: [4091 bits]
     gpg: assuming signed data in `foo-2.0.tar.gz'
     gpg: Signature made Sun 18 Oct 2015 11:11:09 PM JST using RSA key ID 72543FAF
     ...

COMMANDLINE OPTIONS

   For the basic usage, uscan does not require to set these options.

   --no-conf, --noconf
       Don't read any configuration files. This can only be used as the
       first option given on the command-line.

   --no-verbose
       Don't report verbose information. (default)

   --verbose, -v
       Report verbose information.

   --debug, -vv
       Report verbose information including the downloaded web pages as
       processed to STDERR for debugging.

   --dehs
       Send DEHS style output (XML-type) to STDOUT, while send all other
       uscan output to STDERR.

   --no-dehs
       Use only traditional uscan output format. (default)

   --download, -d
       Download the new upstream release. (default)

   --force-download, -dd
       Download the new upstream release even if up-to-date. (may not
       overwrite the local file)

   --overwrite-download, -ddd
       Download the new upstream release even if up-to-date. (may
       overwrite the local file)

   --no-download, --nodownload
       Don't download and report information.

       Previously downloaded tarballs may be used.

       Change default to --skip-signature.

   --signature
       Download signature. (default)

   --no-signature
       Don't download signature but verify if already downloaded.

   --skip-signature
       Don't bother download signature nor verifying signature.

   --safe, --report
       Avoid running unsafe scripts by skipping both the repacking of the
       downloaded package and the updating of the new source tree.

       Change default to --no-download and --skip-signature.

       When the objective of running uscan is to gather the upstream
       package status under the security conscious environment, please
       make sure to use this option.

   --report-status
       This is equivalent of setting "--verbose --safe".

   --download-version version
       Specify the version which the upstream release must match in order
       to be considered, rather than using the release with the highest
       version.  (a best effort feature)

   --download-debversion version
       Specify the Debian package version to download the corresponding
       upstream release version.  The dversionmangle and uversionmangle
       rules are considered.  (a best effort feature)

   --download-current-version
       Download the currently packaged version.  (a best effort feature)

   --check-dirname-level N
       See the below section "Directory name checking" for an explanation
       of this option.

   --check-dirname-regex regex
       See the below section "Directory name checking" for an explanation
       of this option.

   --destdir
       Set the path of directory to which to download instead of its
       default ../.  If the specified path is not absolute, it will be
       relative to one of the current directory or, if directory scanning
       is enabled, the package's source directory.

   --package package
       Specify the name of the package to check for rather than examining
       debian/changelog; this requires the --upstream-version (unless a
       version is specified in the watch file) and --watchfile options as
       well.  Furthermore, no directory scanning will be done and nothing
       will be downloaded.  This option automatically sets --no-download
       and --skip-signature; and probably most useful in conjunction with
       the DEHS system (and --dehs).

   --upstream-version upstream-version
       Specify the current upstream version rather than examine
       debian/watch or debian/changelog to determine it. This is ignored
       if a directory scan is being performed and more than one
       debian/watch file is found.

   --watchfile watchfile
       Specify the watchfile rather than perform a directory scan to
       determine it. If this option is used without --package, then uscan
       must be called from within the Debian package source tree (so that
       debian/changelog can be found simply by stepping up through the
       tree).

   --bare
       Disable all site specific special case codes to perform URL
       redirections and page content alterations.

   --no-exclusion
       Don't automatically exclude files mentioned in debian/copyright
       field Files-Excluded.

   --pasv
       Force PASV mode for FTP connections.

   --no-pasv
       Don't use PASV mode for FTP connections.

   --no-symlink
       Don't rename nor repack upstream tarball.

   --timeout N
       Set timeout to N seconds (default 20 seconds).

   --user-agent, --useragent
       Override the default user agent header.

   --help
       Give brief usage information.

   --version
       Display version information.

   uscan also accepts following options and passes them to mk-origtargz:

   --symlink
       Make orig.tar.gz (with the appropriate extension) symlink to the
       downloaded files. (This is the default behavior.)

   --copy
       Instead of symlinking as described above, copy the downloaded
       files.

   --rename
       Instead of symlinking as described above, rename the downloaded
       files.

   --repack
       After having downloaded an lzma tar, xz tar, bzip tar, gz tar, zip,
       jar, xpi archive, repack it to the specified compression (see
       --compression).

       The unzip package must be installed in order to repack zip and jar
       archives, the mozilla-devscripts package must be installed to
       repack xpi archives, and the xz-utils package must be installed to
       repack lzma or xz tar archives.

   --compression [ gzip | bzip2 | lzma | xz ]
       In the case where the upstream sources are repacked (either because
       --repack option is given or debian/copyright contains the field
       Files-Excluded), it is possible to control the compression method
       via the parameter.  The default is gzip for normal tarballs, and xz
       for tarballs generated directly from the git repository.

   --copyright-file copyright-file
       Exclude files mentioned in Files-Excluded in the given copyright-
       file.  This is useful when running uscan not within a source
       package directory.

DEVSCRIPT CONFIGURATION VARIABLES

   For the basic usage, uscan does not require to set these configuration
   variables.

   The two configuration files /etc/devscripts.conf and ~/.devscripts are
   sourced by a shell in that order to set configuration variables. These
   may be overridden by command line options. Environment variable
   settings are ignored for this purpose. If the first command line option
   given is --noconf, then these files will not be read. The currently
   recognized variables are:

   USCAN_DOWNLOAD
       If this is set to no, then newer upstream files will not be
       downloaded; this is equivalent to the --no-download options.

   USCAN_SAFE
       If this is set to yes, then uscan avoids running unsafe scripts by
       skipping both the repacking of the downloaded package and the
       updating of the new source tree; this is equivalent to the --safe
       options; this also sets the default to --no-download and
       --skip-signature.

   USCAN_PASV
       If this is set to yes or no, this will force FTP connections to use
       PASV mode or not to, respectively. If this is set to default, then
       Net::FTP(3) makes the choice (primarily based on the FTP_PASSIVE
       environment variable).

   USCAN_TIMEOUT
       If set to a number N, then set the timeout to N seconds. This is
       equivalent to the --timeout option.

   USCAN_SYMLINK
       If this is set to no, then a pkg_version.orig.tar.{gz|bz2|lzma|xz}
       symlink will not be made (equivalent to the --no-symlink option).
       If it is set to yes or symlink, then the symlinks will be made. If
       it is set to rename, then the files are renamed (equivalent to the
       --rename option).

   USCAN_DEHS_OUTPUT
       If this is set to yes, then DEHS-style output will be used. This is
       equivalent to the --dehs option.

   USCAN_VERBOSE
       If this is set to yes, then verbose output will be given.  This is
       equivalent to the --verbose option.

   USCAN_USER_AGENT
       If set, the specified user agent string will be used in place of
       the default.  This is equivalent to the --user-agent option.

   USCAN_DESTDIR
       If set, the downloaded files will be placed in this  directory.
       This is equivalent to the --destdir option.

   USCAN_REPACK
       If this is set to yes, then after having downloaded a bzip tar,
       lzma tar, xz tar, or zip archive, uscan will repack it to the
       specified compression (see --compression). This is equivalent to
       the --repack option.

   USCAN_EXCLUSION
       If this is set to no, files mentioned in the field Files-Excluded
       of debian/copyright will be ignored and no exclusion of files will
       be tried.  This is equivalent to the --no-exclusion option.

EXIT STATUS

   The exit status gives some indication of whether a newer version was
   found or not; one is advised to read the output to determine exactly
   what happened and whether there were any warnings to be noted.

   0   Either --help or --version was used, or for some watch file which
       was examined, a newer upstream version was located.

   1   No newer upstream versions were located for any of the watch files
       examined.

ADVANCED FEATURES

   uscan has many other enhanced features which are skipped in the above
   section for the simplicity.  Let's check their highlights.

   uscan actually scans not just the current directory but all its
   subdirectories looking for debian/watch to process them all.  See
   "Directory name checking".

   uscan can be executed with path as its argument to change the starting
   directory of search from the current directory to path .

   See "COMMANDLINE OPTIONS" and "DEVSCRIPT CONFIGURATION VARIABLES" for
   other variations.

   Custom script
   The optional script parameter in debian/watch means to execute script
   with options after processing this line if specified.

   For compatibility with other tools such as git-buildpackage, it may not
   be wise to create custom scripts with random behavior.  In general,
   uupdate is the best choice for the non-native package and custom
   scripts, if created, should behave as if uupdate.  For possible use
   case, see <http://bugs.debian.org/748474> as an example.

   URL diversion
   Some popular web sites changed their web page structure causing
   maintenance problems to the watch file.  There are some redirection
   services created to ease maintenance of the watch file.  Currently,
   uscan makes automatic diversion of URL requests to the following URLs
   to cope with this situation.

   *   <http://sf.net>

   *   <http://pypi.python.org>

   Directory name checking
   Similarly to several other scripts in the devscripts package, uscan
   explores the requested directory trees looking for debian/changelog and
   debian/watch files. As a safeguard against stray files causing
   potential problems, and in order to promote efficiency, it will examine
   the name of the parent directory once it finds the debian/changelog
   file, and check that the directory name corresponds to the package
   name. It will only attempt to download newer versions of the package
   and then perform any requested action if the directory name matches the
   package name. Precisely how it does this is controlled by two
   configuration file variables DEVSCRIPTS_CHECK_DIRNAME_LEVEL and
   DEVSCRIPTS_CHECK_DIRNAME_REGEX, and their corresponding command-line
   options --check-dirname-level and --check-dirname-regex.

   DEVSCRIPTS_CHECK_DIRNAME_LEVEL can take the following values:

   0   Never check the directory name.

   1   Only check the directory name if we have had to change directory in
       our search for debian/changelog, that is, the directory containing
       debian/changelog is not the directory from which uscan was invoked.
       This is the default behavior.

   2   Always check the directory name.

   The directory name is checked by testing whether the current directory
   name (as determined by pwd(1)) matches the regex given by the
   configuration file option DEVSCRIPTS_CHECK_DIRNAME_REGEX or by the
   command line option --check-dirname-regex regex. Here regex is a Perl
   regex (see perlre(3perl)), which will be anchored at the beginning and
   the end. If regex contains a /, then it must match the full directory
   path. If not, then it must match the full directory name. If regex
   contains the string package, this will be replaced by the source
   package name, as determined from the debian/changelog. The default
   value for the regex is: package(-.+)?, thus matching directory names
   such as package and package-version.

HISTORY AND UPGRADING

   This section briefly describes the backwards-incompatible watch file
   features which have been added in each watch file version, and the
   first version of the devscripts package which understood them.

   Pre-version 2
       The watch file syntax was significantly different in those days.
       Don't use it.  If you are upgrading from a pre-version 2 watch
       file, you are advised to read this manpage and to start from
       scratch.

   Version 2
       devscripts version 2.6.90: The first incarnation of the current
       style of watch files.

   Version 3
       devscripts version 2.8.12: Introduced the following: correct
       handling of regex special characters in the path part,
       directory/path pattern matching, version number in several parts,
       version number mangling. Later versions have also introduced URL
       mangling.

       If you are upgrading from version 2, the key incompatibility is if
       you have multiple groups in the pattern part; whereas only the
       first one would be used in version 2, they will all be used in
       version 3. To avoid this behavior, change the non-version-number
       groups to be (?:  ... ) instead of a plain (  ...  ) group.

       *   uscan invokes the custom script as "script --upstream-version
           version ../spkg_version.orig.tar.gz".

       *   uscan invokes the standard uupdate as "uupdate --no-symlink
           --upstream-version version ../spkg_version.orig.tar.gz".

   Version 4
       devscripts version 2.15.10: The first incarnation of watch files
       supporting multiple upstream tarballs.

       The syntax of the watch file is relaxed to allow more spaces for
       readability.

       If you have a custom script in place of uupdate, you may also
       encounter problems updating from Version 3.

       *   uscan invokes the custom script as "script --upstream-version
           version".

       *   uscan invokes the standard uupdate as "uupdate --find
           --upstream-version version".

       Restriction for --dehs is lifted by redirecting other output to
       STDERR when it is activated.

SEE ALSO

   dpkg(1), mk-origtargz(1), perlre(1), uupdate(1), devscripts.conf(5)

AUTHOR

   The original version of uscan was written by Christoph Lameter
   <clameter@debian.org>. Significant improvements, changes and bugfixes
   were made by Julian Gilbey <jdg@debian.org>. HTTP support was added by
   Piotr Roszatycki <dexter@debian.org>. The program was rewritten in Perl
   by Julian Gilbey.




Free and Open Source Software


Free Software Video

Useful Programs

Free Online Courses

Open Opportunity

Open Business