rwgeoip2ccmap - Create a country code prefix map from a GeoIP Legacy file
rwgeoip2ccmap [--input-path=PATH] [--output-path=PATH] [--dry-run]
[--mode={[auto] [ipv4|ipv6] [csv|binary] [geoip2|legacy]}]
[--fields=FIELDS] [--note-add=TEXT] [--note-file-add=FILENAME]
[--invocation-strip]
rwgeoip2ccmap --help
rwgeoip2ccmap --version
Legacy Synopsis
rwgeoip2ccmap {--csv-input | --v6-csv-input | --encoded-input}
[--input-file=PATH] [--output-file=PATH] [--dry-run]
[--note-add=TEXT] [--note-file-add=FILENAME]
[--invocation-strip]
Prefix maps provide a way to map field values to string labels based on a user-defined map file. The country code prefix map, typically named country_codes.pmap, is a special prefix map that maps an IP address to a two-letter country code as defined by ISO 3166 part 1. For additional information, see https://www.iso.org/iso-3166-country-codes.html and https://en.wikipedia.org/wiki/ISO_3166-1_alpha-2.
rwgeoip2ccmap creates the country code prefix map by reading one of the country code database files distributed by MaxMind(R) http://www.maxmind.com/. rwgeoip2ccmap supports these formats:
GeoIP2 or GeoLite2 Country comma-separated value (CSV) files. Set --input-path to the name of the directory containing the files GeoIP2-Country-Blocks-IPv4.csv, GeoIP2-Country-Blocks-IPv6.csv, and GeoIP2-Country-Locations-en.csv (or the GeoLite2-*.csv versions of those files). Since SiLK 3.17.0.
GeoIP2 or GeoLite2 Country binary file. Set --input-path to the GeoIP2-Country.mmdb of GeoLite2-Country.mmdb file. Note: This requires that SiLK was compiled with support for the libmaxminddb
library. Since SiLK 3.17.0.
GeoIP or GeoLite Legacy Country Code binary file, either IPv4 or IPv6. Set --input-path to GeoIP.dat or GeoIPv6.dat. You may also pipe or redirect the file into rwgeoip2ccmap's standard input.
GeoIP or GeoLite Legacy Country Code comma-separated value (CSV) file, either IPv4 or IPv6. Set --input-path to GeoIPCountryWhois.csv or GeoIPv6.csv. You may also pipe or redirect the file into rwgeoip2ccmap's standard input.
The GeoIP2 and GeoLite2 files provide up to three GeoName codes for each network block, where the GeoName may represent a country (and its continent) or only a continent.
The country where the network is located.
The country in which the ISP has registered the network.
The country that is represented by users of the network (consider an overseas military base).
As of SiLK 3.17.2, the --fields switch allows you to select the order in which these values are checked.
See the "EXAMPLES" section below for the details on how to convert these files to a SiLK country-code prefix map file.
The country code prefix map file is used to map IP addresses to country codes in various SiLK tools as documented in the ccfilter(3) man page. As a brief overview, you may
partition by an IP address's country code in rwfilter(1)
display an IP address's country code in rwcut(1)
sort by an IP address's country code in rwsort(1)
bin by an IP address's country code in rwstats(1), rwuniq(1), and rwgroup(1).
Use rwpmapcat(1) with the --country-codes switch to print the contents of a country code prefix map.
The rwpmaplookup(1) command can use the country code mapping file to display the country code for textual IP addresses.
To create a general prefix map file from textual input, use rwpmapbuild(1).
Option names may be abbreviated if the abbreviation is unique or is an exact match for an option. A parameter to an option may be specified as --arg=param or --arg param, though the first form is required for options that take optional parameters.
Specify the type of the input and whether rwgeoip2ccmap creates a prefix map containing IPv4 or IPv6 addresses. MODE_OPTIONS is a comma-separated list of the following values. When not specified, MODE_OPTIONS defaults to auto. Since SiLK 3.12.0; changed in SiLK 3.17.0.
Determine the type of the input based on the argument to --input-path, and determine the type of prefix map to create based on the IP addresses that appear on the first line of input for a CSV file or by the depth of the tree for a binary input file. This is the default mode.
Create an IPv6 prefix map. When reading CSV input, the IPv4 addresses are mapped into the ::ffff:0:0/96 netblock. This value may not be combined with ipv4.
Create an IPv4 prefix map. When reading CSV input, the IPv6 addresses in the ::ffff:0:0/96 netblock are mapped to IPv4 addresses and all other IPv6 addresses are ignored. When reading GeoIP2 binary data, the IPv6 addresses in the ::0:0/96 netblock are mapped to IPv4. This value may not be combined with ipv6.
Read textual input containing IP addresses in a comma separated value format. and create an IPv4 prefix map. Any IPv6 addresses in the ::ffff:0:0/96 netblock are mapped to an IPv4 address and all other IPv6 addresses are ignored. This value may not be combined with binary. Since SiLK 3.17.0.
Read a MaxMind binary country code database file in the GeoIP Legacy, GeoIP2, or GeoLite2 formats. Support for the GeoIP2 formats requires that SiLK was built with libmaxminddb support. This value may not be combined with csv.
Expect the input to be the GeoIP2 or GeoLite2 country code formats (CSV or binary). GeoIP2/GeoLite2 data may not be read from the standard input. The value may not be combined with legacy. Since SiLK 3.17.0.
Expect the input to be the GeoIP Legacy country code format (CSV or binary). This mode is enabled if the input is being read from the standard input. This value may not be combined with geoip2. Since SiLK 3.17.0.
Read the comma-separated value (CSV) or binary forms of the GeoIP2, GeoLite2, GeoIP Legacy, or GeoLite Legacy country code database from PATH. For GeoIP2 data, the --input-path switch is required, and it must either be the location of the GeoLite2-Country.mmdb file for binary data or the directory containing the GeoLite2-Country-Blocks-IPv4.csv file for CSV data. rwgeoip2ccmap supports reading GeoIP Legacy data (either binary or CSV) from the standard input. You may use stdin
or -
to represent the standard input; when this switch is not provided, the input is read from the standard input unless the standard input is a terminal. rwgeoip2ccmap reads read textual input from the terminal if the standard input is explicitly specified as the input. (Added in SiLK 3.17.0 as a replacement for --input-file.)
Write the binary country code prefix map to PATH. You may use stdout
or -
to represent the standard output. When this switch is not provided, the prefix map is written to the standard output unless the standard output is connected to a terminal. (Added in SiLK 3.17.0 as a replacement for --output-file.)
Check the syntax of the input file and do not write the output file. Since SiLK 3.12.0.
Select which of the GeoName fields are used when processing a GeoIP2 or GeoLite2 file, given that these files provide up to three GeoName values for each IP block, some GeoName values map to a continent but not a specific country, and some blocks are flagged as being by an anonymizing proxy or a satellite provider. (For details on the content of the files, see https://dev.maxmind.com/geoip/geoip2/geoip2-city-country-csv-databases/.)
FIELDS is a comma-separated list of one or more of the following values. rwgeoip2ccmap checks each value and stops when it finds one that is non-empty. If all are empty, no mapping is added for the IP block. When the switch not given, the default is "location, registered, represented, continent, flags". Since SiLK 3.17.2.
The supported field values and their mapping to the fields in the GeoIP2 files are:
The country where the IP address block is located. (geoname_id)
The country in which the ISP has registered the IP address block. (registered_country_geoname_id)
The country that is represented by users of the IP address block (consider an overseas military base). (represented_country_geoname_id)
Whether the IP is marked as being used by an anonymizing proxy or a satellite provider. (is_anonymous_proxy, is_satellite_provider)
For binary GeoIP2 files, the continent code. For CSV GeoIP2 files, if this appears before location
, registered
, and represented
, rwgeoip2ccmap uses the first of those fields that is non-empty and maps to either a country or a continent. If this appears after those fields, rwgeoip2ccmap uses the first non-empty field that maps to a country and only when none map to a country does rwgeoip2ccmap check those fields for a continent code.
Add the specified TEXT to the header of the output file as an annotation. This switch may be repeated to add multiple annotations to a file. To view the annotations, use the rwfileinfo(1) tool. Since SiLK 3.12.0.
Open FILENAME and add the contents of that file to the header of the output file as an annotation. This switch may be repeated to add multiple annotations. Currently the application makes no effort to ensure that FILENAME contains text; be careful that you do not attempt to add a SiLK data file as an annotation. Since SiLK 3.12.0.
Do not record the command used to create the prefix map in the output. When this switch is not given, the invocation is written to the file's header, and the invocation may be viewed with rwfileinfo(1). Since SiLK 3.12.0.
Print the available options and exit.
Print the version number and exit the application.
The following switches are deprecated.
Assume the input is the CSV GeoIP Legacy country code data for IPv4. Use --mode=ipv4,csv,legacy as the replacement. Deprecated as of SiLK 3.12.0.
Assume the input is the CSV GeoIP Legacy country code data for IPv6. Use --mode=ipv6,csv,legacy as the replacement. Deprecated as of SiLK 3.12.0.
Assume the input is the specially-encoded binary form of the GeoIP Legacy country code data for either IPv4 or IPv6. Use --mode=binary,legacy as the replacement. Deprecated as of SiLK 3.12.0.
Read the input from PATH. An alias for --input-path. Added in SiLK 3.12.0; deprecated as of SiLK 3.17.0.
Write the binary country code prefix map to PATH. An alias for --output-path. Added in SiLK 3.12.0; deprecated as of SiLK 3.17.0.
The following examples show how to create the country code prefix map file, country_codes.pmap, from various forms of input. Once you have created the country_codes.pmap file, you should copy it to /usr/share/silk/country_codes.pmap so that the ccfilter(3) plug-in can find it. Alternatively, you can set the SILK_COUNTRY_CODES environment variable to the location of the country_codes.pmap file.
In these examples, the dollar sign ($) represents the shell prompt. Some input lines are split over multiple lines in order to improve readability, and a backslash (\) is used to indicate such lines.
Download the CSV version of the MaxMind GeoIP2 or GeoLite2 country database file, e.g., GeoLite2-Country-CSV_20180327.zip. This archive is created with the zip(1) utility and contains a directory of multiple files. Expand the archive with unzip(1):
$ unzip GeoLite2-Country-CSV_20180327.zip
Archive: GeoLite2-Country-CSV_20180327.zip
inflating: GeoLite2-Country-CSV_20180327/GeoLite2-Country-...
...
rwgeoip2ccmap uses three of those files:
A mapping from IPv4 netblocks to a geoname_id
s.
A mapping from IPv6 netblocks to a geoname_id
s.
A mapping from geoname_id
s to continent and country.
Run rwgeoip2ccmap and set --input-path to the name of the directory.
$ rwgeoip2ccmap --input-path=GeoLite2-Country-CSV_20180327 \
--output-path=country_codes.pmap
Support for reading GeoIP2 binary files requires that rwgeoip2ccmap was compiled with support for the libmaxminddb
library.
Download the binary version of the MaxMind GeoIP2 or GeoLite2 country database file, e.g., GeoLite2-Country_20180327.tar.gz. The file is a compressed (gzip(1)) tape archive (tar(1)). Most versions of the tar program allow you to expand the archive using
$ tar zxf GeoLite2-Country_20180327.tar.gz
Older versions of tar may require you to invoke gzip yourself
$ gzip -d -c GeoLite2-Country_20180327.tar.gz \
| tar cf -
The result is a directory named GeoLite2-Country_20180327 or similar.
Run rwgeoip2ccmap and set --input-path to the name of the GeoLite2-Country.mmdb file in the directory.
$ rwgeoip2ccmap \
--input-path=GeoLite2-Country_20180327/GeoLite2-Country.mmdb \
--output-path=country_codes.pmap
Download the CSV version of the MaxMind GeoIP Legacy Country database for IPv4, GeoIPCountryCSV.zip. Running unzip -l on the zip file should show a single file, GeoIPCountryWhois.csv.) To expand this file, use the unzip(1) utility.
$ unzip GeoIPCountryCSV.zip
Archive: GeoIPCountryCSV.zip
inflating: GeoIPCountryWhois.csv
Create the country_codes.pmap file by running
$ rwgeoip2ccmap --input-path=GeoIPCountryWhois.csv \
--output-path=country_codes.pmap
You may avoid creating the GeoIPCountryWhois.csv file by using the -p option of unzip to pass the output of unzip directly to rwgeoip2ccmap:
$ unzip -p GeoIPCountryCSV.zip \
| rwgeoip2ccmap --mode=ipv4 --output-path=country_codes.pmap
If you download the IPv6 version of the MaxMind GeoIP Legacy Country database, use the following command to create the country_codes.pmap file:
$ gzip -d -c GeoIPv6.csv.gz \
| rwgeoip2ccmap --mode=ipv6 > country_codes.pmap
Since the GeoIPv6.csv.gz file only contains IPv6 addresses, the resulting country_codes.pmap file will display the unknown value (--
) for any IPv4 address. See the next example for a solution.
To create a country_codes.pmap mapping file that supports both IPv4 and IPv6 addresses, download both of the Legacy CSV files (GeoIPv6.csv.gz and GeoIPCountryCSV.zip) from MaxMind.
You need to uncompress both files and feed the result as a single stream to the standard input of rwgeoip2ccmap. This can be done in a few commands:
$ gzip -d GeoIPv6.csv.gz
$ unzip GeoIPCountryCSV.zip
$ cat GeoIPv6.csv GeoIPCountryWhois.csv \
| rwgeoip2ccmap --mode=ipv6 > country_codes.pmap
Alternatively, if your shell supports it, you may be able to use a subshell to avoid having to store the uncompressed data:
$ ( gzip -d -c GeoIPv6.csv.gz ; unzip -p GeoIPCountryCSV.zip ) \
| rwgeoip2ccmap --mode=ipv6 > country_codes.pmap
To print the contents of a file that rwgeoip2ccmap creates, use the rwpmapcat(1) command, and specify the --country-codes switch:
$ rwpmapcat --country-codes=country_codes.pmap | head -5
ipBlock|value|
0.0.0.0/7| --|
2.0.0.0/14| --|
2.4.0.0/15| --|
2.6.0.0/17| --|
To reduce the number of lines in the output by combining CIDR blocks into IP ranges, use the --no-cidr-blocks switch:
$ rwpmapcat --country-codes=country_codes.pmap --no-cidr-blocks \
| head -5
startIP| endIP|value|
0.0.0.0| 2.6.190.55| --|
2.6.190.56| 2.6.190.63| gb|
2.6.190.64| 2.255.255.255| --|
3.0.0.0| 4.17.135.31| us|
To skip IP blocks that are unassigned and have the label --
, use the --ignore-label switch:
$ rwpmapcat --country-codes=country_codes.pmap --ignore-label=-- \
| head -5
ipBlock|value|
2.6.190.56/29| gb|
3.0.0.0/8| us|
4.0.0.0/12| us|
4.16.0.0/16| us|
To print the contents of the default country code prefix map, specify --country-codes without an argument:
$ export SILK_COUNTRY_CODES=country_codes.pmap
$ rwpmapcat --country-codes --ignore-label=-- | head -5
ipBlock|value|
2.6.190.56/29| gb|
3.0.0.0/8| us|
4.0.0.0/12| us|
4.16.0.0/16| us|
If you print the output of rwgeoip2ccmap without using the --country-codes switch, the numerical values are not decoded to characters and the output resembles the following:
$ rwpmapcat --no-cidr-blocks country_codes.pmap | head -5
startIP| endIP| value|
0.0.0.0| 2.6.190.55| 11565|
2.6.190.56| 2.6.190.63| 26466|
2.6.190.64| 2.255.255.255| 11565|
3.0.0.0| 4.17.135.31| 30067|
Use rwpmaplookup(1) to get the country code for specific IP address(es). Use the --no-files switch when specific the IP addresses on the command line; otherwise rwpmaplookup treats its arguments as text files containing IP addresses. The --country-code switch is required for the prefix map's data to be interpreted correctly. Give an argument to the switch for a specific file, or no argument to use the default country code prefix map.
$ rwpmaplookup --country-codes=country_codes.pmap --no-files \
3.4.5.6 4.5.6.7
key|value|
3.4.5.6| us|
4.5.6.7| us|
$ export SILK_COUNTRY_CODES=country_codes.pmap
$ cat ips.txt
3.4.5.6
4.5.6.7
$ rwpmaplookup --country-codes ips.txt
key|value|
3.4.5.6| us|
4.5.6.7| us|
The SiLK tools support using only a single country code mapping file. There may be occasions where you want to use multiple country code mapping files; for example, to see changes in an IP block's country over time, or to build separate files for each of GeoIP2 fields (location, registered, represented). One way to do this is loop through the files setting the SILK_COUNTRY_CODES environment variable to each filename and running the SiLK commands. An alternative approach is to convert the country code mapping files to ordinary prefix map files and leverage the SiLK tools' support for using multiple prefix map files in a single command.
To convert a country-code prefix map to an ordinary prefix map, use rwpmapcat(1) to print the contents of the country code prefix map file as text, and then use rwpmapbuild(1) to convert the text to an ordinary prefix map.
First, create a text file where you define a name for this prefix map, specify the mode (as either ipv4
or ipv6
), and specify the default
value to be --
:
$ cat /tmp/mymap.txt
map-name cc-old
mode ipv4
default --
$
Append the output of rwpmapcat to this file, using the space character as the delimiter.
$ rwpmapcat --no-title --delimited=' ' --ignore-label=-- \
--country-codes=country_codes.pmap \
>> /tmp/mymap.txt
$ head -5 /tmp/mymap.txt
map-name cc-old
mode ipv4
default --
2.6.190.56/29 gb
3.0.0.0/8 us
Use rwpmapbuild to create the prefix map and save it as cc-old.pmap:
$ rwpmapbuild --input-path=/tmp/mymap.txt --output-path=cc-old.pmap
Use rwfileinfo(1) to check the map-name for the prefix map file.
$ rwfileinfo --fields=prefix-map cc-old.pmap
cc-old.pmap:
prefix-map v1: cc-old
Use rwpmapcat to view its contents:
$ rwpmapcat --ignore-label=-- --no-cidr-blocks cc-old.pmap | head -5
startIP| endIP|label|
2.6.190.56| 2.6.190.63| gb|
3.0.0.0| 4.17.135.31| us|
4.17.135.32| 4.17.135.63| ca|
4.17.135.64| 4.17.142.255| us|
You can use the --pmap-file switch of various SiLK tools to load and use the cc-old.pmap prefix map file (see pmapfilter(3) for usage).
For example, suppose you have the file data.rw of SiLK Flow data:
$ rwcut --fields=sip,dip --ipv6-policy=ignore data.rw
sIP| dIP|
3.4.5.6| 4.5.6.7|
4.5.6.7| 3.4.5.6|
To map the source IP addresses in the file data.rw using the prefix map file (the src-cc-old
field) and a country code file (the scc
field) with rwcut(1):
$ export SILK_COUNTRY_CODES=country_codes.pmap
$ rwcut --pmap-file cc-old.pmap --ipv6-policy=ignore \
--fields=sip,src-cc-old,scc data.rw
sIP|src-cc-old|scc|
3.4.5.6| us| us|
4.5.6.7| us| us|
The SiLK tools normally refuse to overwrite existing files. Setting SILK_CLOBBER to a non-empty value removes this restriction.
ccfilter(3), rwpmaplookup(1), rwfilter(1), rwcut(1), rwsort(1), rwstats(1), rwuniq(1), rwgroup(1), rwpmapbuild(1), rwfileinfo(1), pmapfilter(3), silk(7), gzip(1), zip(1), unzip(1), https://dev.maxmind.com/geoip/geoip2/geolite2/, https://dev.maxmind.com/geoip/legacy/geolite/, https://www.iso.org/iso-3166-country-codes.html, https://en.wikipedia.org/wiki/ISO_3166-1_alpha-2
Support for GeoIP2 and GeoLite2 input files were added in SiLK 3.17.0.
Support for the binary form of the GeoIP Legacy format was removed in SiLK 3.12.0 and restored in SiLK 3.12.2.
MaxMind, GeoIP, and related trademarks are the trademarks of MaxMind, Inc.