plink_help.txt

PLINK v1.90b6.21 64-bit (19 Oct 2020)          www.cog-genomics.org/plink/1.9/
(C) 2005-2020 Shaun Purcell, Christopher Chang   GNU General Public License v3

In the command line flag definitions that follow,
  * <angle brackets> denote a required parameter, where the text between the
    angle brackets describes its nature.
  * ['square brackets + single-quotes'] denotes an optional modifier.  Use the
    EXACT text in the quotes.
  * [{bar|separated|braced|bracketed|values}] denotes a collection of mutually
    exclusive optional modifiers (again, the exact text must be used).  When
    there are no outer square brackets, one of the choices must be selected.
  * ['quoted_text='<description of value>] denotes an optional modifier that
    must begin with the quoted text, and be followed by a value with no
    whitespace in between.  '|' may also be used here to indicate mutually
    exclusive options.
  * [square brackets without quotes or braces] denote an optional parameter,
    where the text between the brackets describes its nature.
  * An ellipsis (...) indicates that you may enter multiple parameters of the
    specified type.

  plink <input flag(s)...> [command flag(s)...] [other flag(s)...]
  plink --help [flag name(s)...]

Most PLINK runs require exactly one main input fileset.  The following flags
are available for defining its form and location:

  --bfile [prefix] : Specify .bed + .bim + .fam prefix (default 'plink').
  --bed <filename> : Specify full name of .bed file.
  --bim <filename> : Specify full name of .bim file.
  --fam <filename> : Specify full name of .fam file.

  --keep-autoconv  : With --file/--tfile/--lfile/--vcf/--bcf/--data/--23file,
                     don't delete autogenerated binary fileset at end of run.

  --file [prefix]  : Specify .ped + .map filename prefix (default 'plink').
  --ped <filename> : Specify full name of .ped file.
  --map <filename> : Specify full name of .map file.

  --no-fid         : .fam/.ped file does not contain column 1 (family ID).
  --no-parents     : .fam/.ped file does not contain columns 3-4 (parents).
  --no-sex         : .fam/.ped file does not contain column 5 (sex).
  --no-pheno       : .fam/.ped file does not contain column 6 (phenotype).

  --tfile [prefix] : Specify .tped + .tfam filename prefix (default 'plink').
  --tped <fname>   : Specify full name of .tped file.
  --tfam <fname>   : Specify full name of .tfam file.

  --lfile [prefix] : Specify .lgen + .map + .fam (long-format fileset) prefix.
  --lgen <fname>   : Specify full name of .lgen file.
  --reference <fn> : Specify default allele file accompanying .lgen input.
  --allele-count   : When used with --lfile/--lgen + --reference, specifies
                     that the .lgen file contains reference allele counts.

  --vcf <filename> : Specify full name of .vcf or .vcf.gz file.
  --bcf <filename> : Specify full name of BCF2 file.

  --data [prefix]  : Specify Oxford .gen + .sample prefix (default 'plink').
  --gen <filename> : Specify full name of .gen or .gen.gz file.
  --bgen <f> ['snpid-chr'] : Specify full name of .bgen file.
  --sample <fname> : Specify full name of .sample file.

  --23file <fname> [FID] [IID] [sex] [pheno] [pat. ID] [mat. ID] :
    Specify 23andMe input file.

  --grm-gz [prfx]  : Specify .grm.gz + .grm.id (GCTA rel. matrix) prefix.
  --grm-bin [prfx] : Specify .grm.bin + .grm.N.bin + .grm.id (GCTA triangular
                     binary relationship matrix) filename prefix.

  --dummy <sample ct> <SNP ct> [missing geno freq] [missing pheno freq]
          [{acgt | 1234 | 12}] ['scalar-pheno']
    This generates a fake input dataset with the specified number of samples
    and SNPs.  By default, the missing genotype and phenotype frequencies are
    zero, and genotypes are As and Bs (change the latter with
    'acgt'/'1234'/'12').  The 'scalar-pheno' modifier causes a normally
    distributed scalar phenotype to be generated instead of a binary one.

  --simulate <simulation parameter file> [{tags | haps}] [{acgt | 1234 | 12}]
  --simulate-qt <sim. parameter file> [{tags | haps}] [{acgt | 1234 | 12}]
    --simulate generates a fake input dataset with disease-associated SNPs,
    while --simulate-qt generates a dataset with quantitative trait loci.

Output files have names of the form 'plink.<extension>' by default.  You can
change the 'plink' prefix with

  --out <prefix>   : Specify prefix for output files.

Most runs also require at least one of the following commands:

  --make-bed
    Create a new binary fileset.  Unlike the automatic text-to-binary
    converters (which only heed chromosome filters), this supports all of
    PLINK's filtering flags.
  --make-just-bim
  --make-just-fam
    Variants of --make-bed which only write a new .bim or .fam file.  Can be
    used with only .bim/.fam input.
    USE THESE CAUTIOUSLY.  It is very easy to desynchronize your binary
    genotype data and your .bim/.fam indexes if you use these commands
    improperly.  If you have any doubt, stick with --make-bed.

  --recode <output format> [{01 | 12}] [{tab | tabx | spacex | bgz | gen-gz}]
           ['include-alt'] ['omit-nonmale-y']
    Create a new text fileset with all filters applied.  The following output
    formats are supported:
    * '23': 23andMe 4-column format.  This can only be used on a single
      sample's data (--keep may be handy), and does not support multicharacter
      allele codes.
    * 'A': Sample-major additive (0/1/2) coding, suitable for loading from R.
      If you need uncounted alleles to be named in the header line, add the
      'include-alt' modifier.
    * 'AD': Sample-major additive (0/1/2) + dominant (het=1/hom=0) coding.
      Also supports 'include-alt'.
    * 'A-transpose': Variant-major 0/1/2.
    * 'beagle': Unphased per-autosome .dat and .map files, readable by early
      BEAGLE versions.
    * 'beagle-nomap': Single .beagle.dat file.
    * 'bimbam': Regular BIMBAM format.
    * 'bimbam-1chr': BIMBAM format, with a two-column .pos.txt file.  Does not
      support multiple chromosomes.
    * 'fastphase': Per-chromosome fastPHASE files, with
      .chr-<chr #>.recode.phase.inp filename extensions.
    * 'fastphase-1chr': Single .recode.phase.inp file.  Does not support
      multiple chromosomes.
    * 'HV': Per-chromosome Haploview files, with .chr-<chr #>{.ped,.info}
      filename extensions.
    * 'HV-1chr': Single Haploview .ped + .info file pair.  Does not support
      multiple chromosomes.
    * 'lgen': PLINK 1 long-format (.lgen + .fam + .map), loadable with --lfile.
    * 'lgen-ref': .lgen + .fam + .map + .ref, loadable with --lfile +
       --reference.
    * 'list': Single genotype-based list, up to 4 lines per variant.  To omit
      nonmale genotypes on the Y chromosome, add the 'omit-nonmale-y' modifier.
    * 'rlist': .rlist + .fam + .map fileset, where the .rlist file is a
      genotype-based list which omits the most common genotype for each
      variant.  Also supports 'omit-nonmale-y'.
    * 'oxford': Oxford-format .gen + .sample.  With the 'gen-gz' modifier, the
      .gen file is gzipped.
    * 'ped': PLINK 1 sample-major (.ped + .map), loadable with --file.
    * 'compound-genotypes': Same as 'ped', except that the space between each
      pair of same-variant allele codes is removed.
    * 'structure': Structure-format.
    * 'transpose': PLINK 1 variant-major (.tped + .tfam), loadable with
      --tfile.
    * 'vcf', 'vcf-fid', 'vcf-iid': VCFv4.2.  'vcf-fid' and 'vcf-iid' cause
      family IDs or within-family IDs respectively to be used for the sample
      IDs in the last header row, while 'vcf' merges both IDs and puts an
      underscore between them.  If the 'bgz' modifier is added, the VCF file is
      block-gzipped.
      The A2 allele is saved as the reference and normally flagged as not based
      on a real reference genome (INFO:PR).  When it is important for reference
      alleles to be correct, you'll also want to include --a2-allele and
      --real-ref-alleles in your command.
    In addition,
    * The '12' modifier causes A1 (usually minor) alleles to be coded as '1'
      and A2 alleles to be coded as '2', while '01' maps A1 -> 0 and A2 -> 1.
    * The 'tab' modifier makes the output mostly tab-delimited instead of
      mostly space-delimited.  'tabx' and 'spacex' force all tabs and all
      spaces, respectively.

  --flip-scan ['verbose']
    (alias: --flipscan)
    LD-based scan for case/control strand inconsistency.

  --write-covar
    If a --covar file is loaded, --make-bed/--make-just-fam and --recode
    automatically generate an updated version (with all filters applied).
    However, if you do not wish to simultaneously generate a new genotype file,
    you can use --write-covar to just produce a pruned covariate file.

  --write-cluster ['omit-unassigned']
    If clusters are specified with --within/--family, this generates a new
    cluster file (with all filters applied).  The 'omit-unassigned' modifier
    causes unclustered samples to be omitted from the file; otherwise their
    cluster is 'NA'.

  --write-set
  --set-table
    If sets have been defined, --write-set dumps 'END'-terminated set
    membership lists to <output prefix>.set, while --set-table writes a
    variant-by-set membership table to <output prefix>.set.table.

  --merge <.ped filename> <.map filename>
  --merge <text fileset prefix>
  --bmerge <.bed filename> <.bim filename> <.fam filename>
  --bmerge <binary fileset prefix>
    Merge the given fileset with the initially loaded fileset, writing the
    result to <output prefix>.bed + .bim + .fam.  (It is no longer necessary to
    simultaneously specify --make-bed.)
  --merge-list <filename>
    Merge all filesets named in the text file with the reference fileset, if
    one was specified.  (However, this can also be used *without* a reference;
    in that case, the newly created fileset is then treated as the reference by
    most other PLINK operations.)  The text file is interpreted as follows:
    * If a line contains only one name, it is assumed to be the prefix for a
      binary fileset.
    * If a line contains exactly two names, they are assumed to be the full
      filenames for a text fileset (.ped first, then .map).
    * If a line contains exactly three names, they are assumed to be the full
      filenames for a binary fileset (.bed, then .bim, then .fam).

  --write-snplist
  --list-23-indels
    --write-snplist writes a .snplist file listing the names of all variants
    which pass the filters and inclusion thresholds you've specified, while
    --list-23-indels writes the subset with 23andMe-style indel calls (D/I
    allele codes).

  --list-duplicate-vars ['require-same-ref'] ['ids-only'] ['suppress-first']
    --list-duplicate-vars writes a .dupvar file describing all groups of
    variants with matching positions and allele codes.
    * By default, A1/A2 allele assignments are ignored; use 'require-same-ref'
      to override this.
    * Normally, the report contains position and allele codes.  To remove them
      (and produce a file directly usable with e.g. --extract/--exclude), use
      'ids-only'.  Note that this command will fail in 'ids-only' mode if any
      of the reported IDs are not unique.
    * 'suppress-first' causes the first variant ID in each group to be omitted
      from the report.

  --freq [{counts | case-control}] ['gz']
  --freqx ['gz']
    --freq generates a basic allele frequency (or count, if the 'counts'
    modifier is present) report.  This can be combined with --within/--family
    to produce a cluster-stratified allele frequency/count report instead, or
    the 'case-control' modifier to report case and control allele frequencies
    separately.
    --freqx generates a more detailed genotype count report, designed for use
    with --read-freq.

  --missing ['gz']
    Generate sample- and variant-based missing data reports.  If clusters are
    defined, the variant-based report is cluster-stratified.  'gz' causes the
    output files to be gzipped.
    Unlike most other commands, this doesn't treat het. haploids as missing.

  --test-mishap
    Check for association between missing calls and flanking haplotypes.

  --hardy ['midp'] ['gz']
    Generate a Hardy-Weinberg exact test p-value report.  (This does NOT
    simultaneously filter on the p-value any more; use --hwe for that.)  With
    the 'midp' modifier, the test applies the mid-p adjustment described in
    Graffelman J, Moreno V (2013) The mid p-value in exact tests for
    Hardy-Weinberg Equilibrium.

  --mendel ['summaries-only']
    Generate a Mendel error report.  The 'summaries-only' modifier causes the
    .mendel file (listing every single error) to be skipped.

  --het ['small-sample'] ['gz']
  --ibc
    Estimate inbreeding coefficients.  --het reports method-of-moments
    estimates, while --ibc calculates all three values described in Yang J, Lee
    SH, Goddard ME and Visscher PM (2011) GCTA: A Tool for Genome-wide Complex
    Trait Analysis.  (That paper also describes the relationship matrix
    computation we reimplement.)
    * These functions require decent MAF estimates.  If there are very few
      samples in your immediate fileset, --read-freq is practically mandatory
      since imputed MAFs are wildly inaccurate in that case.
    * They also assume the marker set is in approximate linkage equilibrium.
    * By default, --het omits the n/(n-1) multiplier in Nei's expected
      homozygosity formula.  The 'small-sample' modifier causes it to be
      included, while forcing --het to use MAFs imputed from founders in the
      immediate dataset.

  --check-sex [female max F] [male min F]
  --check-sex ycount [female max F] [male min F] [female max Y obs]
                     [male min Y obs]
  --check-sex y-only [female max Y obs] [male min Y obs]
  --impute-sex [female max F] [male min F]
  --impute-sex ycount [female max F] [male min F] [female max Y obs]
                      [male min Y obs]
  --impute-sex y-only [female max Y obs] [male min Y obs]
    --check-sex normally compares sex assignments in the input dataset with
    those imputed from X chromosome inbreeding coefficients.
    * Make sure that the X chromosome pseudo-autosomal region has been split
      off (with e.g. --split-x) before using this.
    * You also need decent MAF estimates (so, with very few samples in your
      immediate fileset, use --read-freq), and your marker set should be in
      approximate linkage equilibrium.
    * By default, F estimates smaller than 0.2 yield female calls, and values
      larger than 0.8 yield male calls.  If you pass numeric parameter(s) to
      --check-sex, the first two control these thresholds.
    There are now two modes which consider Y chromosome data.
    * In 'ycount' mode, gender is still imputed from the X chromosome, but
      female calls are downgraded to ambiguous whenever more than 0 nonmissing
      Y genotypes are present, and male calls are downgraded when fewer than 0
      are present.  (Note that these are counts, not rates.)  These thresholds
      are controllable with --check-sex ycount's optional 3rd and 4th numeric
      parameters.
    * In 'y-only' mode, gender is imputed from nonmissing Y genotype counts.
      The male minimum threshold defaults to 1 instead of zero in this case.
    --impute-sex changes sex assignments to the imputed values, and is
    otherwise identical to --check-sex.  It must be used with
    --make-bed/--recode/--write-covar.

  --fst ['case-control']
    (alias: --Fst)
    Estimate Wright's Fst for each autosomal diploid variant using the method
    introduced in Weir BS, Cockerham CC (1984) Estimating F-statistics for the
    analysis of population structure, given a set of subpopulations defined via
    --within.  Raw and weighted global means are also reported.
    * If you're interested in the global means, it is usually best to perform
      this calculation on a marker set in approximate linkage equilibrium.
    * If you have only two subpopulations, you can represent them with
      case/control status and use the 'case-control' modifier.

  --indep <window size>['kb'] <step size (variant ct)> <VIF threshold>
  --indep-pairwise <window size>['kb'] <step size (variant ct)> <r^2 threshold>
  --indep-pairphase <window size>['kb'] <step size (variant ct)> <r^2 thresh>
    Generate a list of markers in approximate linkage equilibrium.  With the
    'kb' modifier, the window size is in kilobase instead of variant count
    units.  (Pre-'kb' space is optional, i.e. "--indep-pairwise 500 kb 5 0.5"
    and "--indep-pairwise 500kb 5 0.5" have the same effect.)
    Note that you need to rerun PLINK using --extract or --exclude on the
    .prune.in/.prune.out file to apply the list to another computation.

  --r [{square | square0 | triangle | inter-chr}] [{gz | bin | bin4}]
      ['spaces'] ['in-phase'] [{d | dprime | dprime-signed}] ['with-freqs']
      ['yes-really']
  --r2 [{square | square0 | triangle | inter-chr}] [{gz | bin | bin4}]
       ['spaces'] ['in-phase'] [{d | dprime | dprime-signed}] ['with-freqs']
       ['yes-really']
    LD statistic reports.  --r yields raw inter-variant correlations, while
    --r2 reports their squares.  You can request results for all pairs in
    matrix format (if you specify 'bin' or one of the shape modifiers), all
    pairs in table format ('inter-chr'), or a limited window in table format
    (default).
    * The 'gz' modifier causes the output text file to be gzipped.
    * 'bin' causes the output matrix to be written in double-precision binary
      format, while 'bin4' specifics single-precision binary.  The matrix is
      square if no shape is explicitly specified.
    * By default, text matrices are tab-delimited; 'spaces' switches this.
    * 'in-phase' adds a column with in-phase allele pairs to table-formatted
      reports.  (This cannot be used with very long allele codes.)
    * 'dprime' adds the absolute value of Lewontin's D-prime statistic to
      table-formatted reports, and forces both r/r^2 and D-prime to be based on
      the maximum likelihood solution to the cubic equation discussed in Gaunt
      T, Rodriguez S, Day I (2007) Cubic exact solutions for the estimation of
      pairwise haplotype frequencies.
      'dprime-signed' keeps the sign, while 'd' skips division by D_{max}.
    * 'with-freqs' adds MAF columns to table-formatted reports.
    * Since the resulting file can easily be huge, you're required to add the
      'yes-really' modifier when requesting an unfiltered, non-distributed all
      pairs computation on more than 400k variants.
    * These computations can be subdivided with --parallel (even when the
      'square' modifier is active).
  --ld <variant ID> <variant ID> ['hwe-midp']
    This displays haplotype frequencies, r^2, and D' for a single pair of
    variants.  When there are multiple biologically possible solutions to the
    haplotype frequency cubic equation, all are displayed (instead of just the
    maximum likelihood solution identified by --r/--r2), along with HWE exact
    test statistics.

  --show-tags <filename>
  --show-tags all
    * If a file is specified, list all variants which tag at least one variant
      named in the file.  (This will normally be a superset of the original
      list, since a variant is considered to tag itself here.)
    * If 'all' mode is specified, for each variant, each *other* variant which
      tags it is reported.

  --blocks ['no-pheno-req'] ['no-small-max-span']
    Estimate haplotype blocks, via Haploview's interpretation of the block
    definition suggested by Gabriel S et al. (2002) The Structure of Haplotype
    Blocks in the Human Genome.
    * Normally, samples with missing phenotypes are not considered by this
      computation; the 'no-pheno-req' modifier lifts this restriction.
    * Normally, size-2 blocks may not span more than 20kb, and size-3 blocks
      are limited to 30kb.  The 'no-small-max-span' modifier removes these
      limits.
    The .blocks file is valid input for PLINK 1.07's --hap command.  However,
    the --hap... family of flags has not been reimplemented in PLINK 1.9 due to
    poor phasing accuracy relative to other software; for now, we recommend
    using BEAGLE instead of PLINK for case/control haplotype association
    analysis.  (You can use "--recode beagle" to export data to BEAGLE 3.3.)
    We apologize for the inconvenience, and plan to develop variants of the
    --hap... flags which handle pre-phased data effectively.

  --distance [{square | square0 | triangle}] [{gz | bin | bin4}] ['ibs']
             ['1-ibs'] ['allele-ct'] ['flat-missing']
    Write a lower-triangular tab-delimited table of (weighted) genomic
    distances in allele count units to <output prefix>.dist, and a list of the
    corresponding sample IDs to <output prefix>.dist.id.  The first row of the
    .dist file contains a single <genome 1-genome 2> distance, the second row
    has the <genome 1-genome 3> and <genome 2-genome 3> distances in that
    order, etc.
    * It is usually best to perform this calculation on a marker set in
      approximate linkage equilibrium.
    * If the 'square' or 'square0' modifier is present, a square matrix is
      written instead; 'square0' fills the upper right triangle with zeroes.
    * If the 'gz' modifier is present, a compressed .dist.gz file is written
      instead of a plain text file.
    * If the 'bin' modifier is present, a binary (square) matrix of
      double-precision floating point values, suitable for loading from R, is
      instead written to <output prefix>.dist.bin.  ('bin4' specifies
      single-precision numbers instead.)  This can be combined with 'square0'
      if you still want the upper right zeroed out, or 'triangle' if you don't
      want to pad the upper right at all.
    * If the 'ibs' modifier is present, an identity-by-state matrix is written
      to <output prefix>.mibs.  '1-ibs' causes distances expressed as genomic
      proportions (i.e. 1 - IBS) to be written to <output prefix>.mdist.
      Combine with 'allele-ct' if you want to generate the usual .dist file as
      well.
    * By default, distance rescaling in the presence of missing genotype calls
      is sensitive to allele count distributions: if variant A contributes, on
      average, twice as much to other pairwise distances as variant B, a
      missing call at variant A will result in twice as large of a missingness
      correction.  To turn this off (because e.g. your missing calls are highly
      nonrandom), use the 'flat-missing' modifier.
    * The computation can be subdivided with --parallel.
  --distance-matrix
  --ibs-matrix
    These deprecated commands are equivalent to "--distance 1-ibs flat-missing
    square" and "--distance ibs flat-missing square", respectively, except that
    they generate space- instead of tab-delimited text matrices.

  --make-rel [{square | square0 | triangle}] [{gz | bin | bin4}]
             [{cov | ibc2 | ibc3}]
    Write a lower-triangular variance-standardized realized relationship matrix
    to <output prefix>.rel, and corresponding IDs to <output prefix>.rel.id.
    * It is usually best to perform this calculation on a marker set in
      approximate linkage equilibrium.
    * 'square', 'square0', 'triangle', 'gz', 'bin', and 'bin4' act as they do
      on --distance.
    * The 'cov' modifier removes the variance standardization step, causing a
      covariance matrix to be calculated instead.
    * By default, the diagonal elements in the relationship matrix are based on
      --ibc's Fhat1; use the 'ibc2' or 'ibc3' modifiers to base them on Fhat2
      or Fhat3 instead.
    * The computation can be subdivided with --parallel.
  --make-grm-gz ['no-gz'] [{cov | ibc2 | ibc3}]
  --make-grm-bin [{cov | ibc2 | ibc3}]
    --make-grm-gz writes the relationships in GCTA's original gzipped list
    format, which describes one pair per line, while --make-grm-bin writes them
    in GCTA 1.1+'s single-precision triangular binary format.  Note that these
    formats explicitly report the number of valid observations (where neither
    sample has a missing call) for each pair, which is useful input for some
    scripts.
    These computations can be subdivided with --parallel.

  --rel-cutoff [val]
    (alias: --grm-cutoff)
    Exclude one member of each pair of samples with relatedness greater than
    the given cutoff value (default 0.025).  If no later operation will cause
    the list of remaining samples to be written to disk, this will save it to
    <output prefix>.rel.id.
    Note that maximizing the remaining sample size is equivalent to the NP-hard
    maximum independent set problem, so we use a greedy algorithm instead of
    guaranteeing optimality.  (Use the --make-rel and --keep/--remove flags if
    you want to try to do better.)

  --ibs-test [permutation count]
  --groupdist [iters] [d]
    Given case/control phenotype data, these commands consider three subsets of
    the distance matrix: pairs of affected samples, affected-unaffected pairs,
    and pairs of unaffected samples.  Each of these subsets has a distribution
    of pairwise genomic distances; --ibs-test uses permutation to estimate
    p-values re: which types of pairs are most similar, while --groupdist
    focuses on the differences between the centers of these distributions and
    estimates standard errors via delete-d jackknife.

  --regress-distance [iters] [d]
    Linear regression of pairwise genomic distances on pairwise average
    phenotypes and vice versa, using delete-d jackknife for standard errors.  A
    scalar phenotype is required.
    * With less than two parameters, d is set to <number of people>^0.6 rounded
      down.  With no parameters, 100k iterations are run.
  --regress-rel [iters] [d]
    Linear regression of pairwise genomic relationships on pairwise average
    phenotypes, and vice versa.  Defaults for iters and d are the same as for
    --regress-distance.

  --genome ['gz'] ['rel-check'] ['full'] ['unbounded'] ['nudge']
    Generate an identity-by-descent report.
    * It is usually best to perform this calculation on a marker set in
      approximate linkage equilibrium.
    * The 'rel-check' modifier excludes pairs of samples with different FIDs
      from the final report.
    * 'full' adds raw pairwise comparison data to the report.
    * The P(IBD=0/1/2) estimator employed by this command sometimes yields
      numbers outside the range [0,1]; by default, these are clipped.  The
      'unbounded' modifier turns off this clipping.
    * Then, when PI_HAT^2 < P(IBD=2), 'nudge' adjusts the final P(IBD=0/1/2)
      estimates to a theoretically possible configuration.
    * The computation can be subdivided with --parallel.

  --homozyg [{group | group-verbose}] ['consensus-match'] ['extend']
            ['subtract-1-from-lengths']
  --homozyg-snp <min var count>
  --homozyg-kb <min length>
  --homozyg-density <max inverse density (kb/var)>
  --homozyg-gap <max internal gap kb length>
  --homozyg-het <max hets>
  --homozyg-window-snp <scanning window size>
  --homozyg-window-het <max hets in scanning window hit>
  --homozyg-window-missing <max missing calls in scanning window hit>
  --homozyg-window-threshold <min scanning window hit rate>
    These commands request a set of run-of-homozygosity reports, and allow you
    to customize how they are generated.
    * If you're satisfied with all the default settings described below, just
      use --homozyg with no modifiers.  Otherwise, --homozyg lets you change a
      few binary settings:
      * 'group[-verbose]' adds a report on pools of overlapping runs of
        homozygosity.  (Automatically set when --homozyg-match is present.)
      * With 'group[-verbose]', 'consensus-match' causes pairwise segmental
        matches to be called based on the variants in the pool's consensus
        segment, rather than the variants in the pairwise intersection.
      * Due to how the scanning window algorithm works, it is possible for a
        reported ROH to be adjacent to a few homozygous variants.  The 'extend'
        modifier causes them to be included in the reported ROH if that
        wouldn't cause a violation of the --homozyg-density bound.
      * By default, segment bp lengths are calculated as <end bp position> -
        <start bp position> + 1.  Therefore, reports normally differ slightly
        from PLINK 1.07, which does not add 1 at the end.  For testing
        purposes, you can use the 'subtract-1-from-lengths' modifier to apply
        the old formula.
    * By default, only runs of homozygosity containing at least 100 variants,
      and of total length >= 1000 kilobases, are noted.  You can change these
      minimums with --homozyg-snp and --homozyg-kb, respectively.
    * By default, a ROH must have at least one variant per 50 kb on average;
      change this bound with --homozyg-density.
    * By default, if two consecutive variants are more than 1000 kb apart, they
      cannot be in the same ROH; change this bound with --homozyg-gap.
    * By default, a ROH can contain an unlimited number of heterozygous calls;
      you can impose a limit with --homozyg-het.
    * By default, the scanning window contains 50 variants; change this with
      --homozyg-window-snp.
    * By default, a scanning window hit can contain at most 1 heterozygous
      call and 5 missing calls; change these limits with --homozyg-window-het
      and --homozyg-window-missing, respectively.
    * By default, for a variant to be eligible for inclusion in a ROH, the hit
      rate of all scanning windows containing the variant must be at least
      0.05; change this threshold with --homozyg-window-threshold.

  --cluster ['cc'] [{group-avg | old-tiebreaks}] ['missing'] ['only2']
    Cluster samples using a pairwise similarity statistic (normally IBS).
    * The 'cc' modifier forces every cluster to have at least one case and one
      control.
    * The 'group-avg' modifier causes clusters to be joined based on average
      instead of minimum pairwise similarity.
    * The 'missing' modifier causes clustering to be based on
      identity-by-missingness instead of identity-by-state, and writes a
      space-delimited identity-by-missingness matrix to disk.
    * The 'only2' modifier causes only a .cluster2 file (which is valid input
      for --within) to be written; otherwise 2 other files will be produced.
    * By default, IBS ties are not broken in the same manner as PLINK 1.07, so
      final cluster solutions tend to differ.  This is generally harmless.
      However, to simplify testing, you can use the 'old-tiebreaks' modifier to
      force emulation of the old algorithm.

  --pca [count] ['header'] ['tabs'] ['var-wts']
    Calculates a variance-standardized relationship matrix (use
    --make-rel/--make-grm-gz/--make-grm-bin to dump it), and extracts the top
    20 principal components.
    * It is usually best to perform this calculation on a marker set in
      approximate linkage equilibrium.
    * You can change the number of PCs by passing a numeric parameter.
    * The 'header' modifier adds a header line to the .eigenvec output file.
      (For compatibility with the GCTA flag of the same name, the default is no
      header line.)
    * The 'tabs' modifier causes the .eigenvec file(s) to be tab-delimited.
    * The 'var-wts' modifier requests an additional .eigenvec.var file with PCs
      expressed as variant weights instead of sample weights.

  --neighbour <n1> <n2>
    (alias: --neighbor)
    Report IBS distances from each sample to their n1th- to n2th-nearest
    neighbors, associated Z-scores, and the identities of those neighbors.
    Useful for outlier detection.

  --assoc ['perm' | 'mperm='<value>] ['perm-count'] [{fisher | fisher-midp}]
          ['counts'] ['set-test']
  --assoc ['perm' | 'mperm='<value>] ['perm-count'] ['qt-means'] ['lin']
          ['set-test']
  --model ['perm' | 'mperm='<value>] ['perm-count']
          [{fisher | fisher-midp | trend-only}] ['set-test']
          [{dom | rec | gen | trend}]
    Basic association analysis report.
    Given a case/control phenotype, --assoc performs a 1df chi-square allelic
    test, while --model performs 4 other tests as well (1df dominant gene
    action, 1df recessive gene action, 2df genotypic, Cochran-Armitage trend).
    * With 'fisher'/'fisher-midp', Fisher's exact test is used to generate
      p-values.  'fisher-midp' also applies Lancaster's mid-p adjustment.
    * 'perm' causes an adaptive permutation test to be performed.
    * 'mperm='<value> causes a max(T) permutation test with the specified
      number of replications to be performed.
    * 'perm-count' causes the permutation test report to include counts instead
      of frequencies.
    * 'counts' causes --assoc to report allele counts instead of frequencies.
    * 'set-test' tests the significance of variant sets.  Requires permutation;
      can be customized with --set-p/--set-r2/--set-max.
    * 'dom', 'rec', 'gen', and 'trend' force the corresponding test to be used
      as the basis for --model permutation.  (By default, the most significant
      result among the allelic, dominant, and recessive tests is used.)
    * 'trend-only' causes only the trend test to be performed.
    Given a quantitative phenotype, --assoc normally performs a Wald test.
    * In this case, the 'qt-means' modifier causes trait means and standard
      deviations stratified by genotype to be reported as well.
    * 'lin' causes the Lin statistic to be computed, and makes it the basis for
      multiple-testing corrections and permutation tests.
    Several other flags (most notably, --aperm) can be used to customize the
    permutation test.

  --mh ['perm' | 'mperm='<value>] ['perm-count'] ['set-test']
    (alias: --cmh)
  --bd ['perm' | 'perm-bd' | 'mperm='<value>] ['perm-count'] ['set-test']
  --mh2
  --homog
    Given a case/control phenotype and a set of clusters, --mh computes 2x2xK
    Cochran-Mantel-Haenszel statistics for each variant, while --bd also
    performs the Breslow-Day test for odds ratio homogeneity.  Permutation and
    variant set testing based on the CMH (default) or Breslow-Day (when
    'perm-bd' is present) statistic are supported.
    The following similar analyses are also available:
    * --mh2 swaps the roles of case/control status and cluster membership,
      performing a phenotype-stratified IxJxK Cochran-Mantel-Haenszel test on
      association between cluster assignments and genotypes.
    * --homog executes an alternative to the Breslow-Day test, based on
      partitioning of the chi-square statistic.

  --gxe [covariate index]
    Given both a quantitative phenotype and a case/control covariate loaded
    with --covar defining two groups, --gxe compares the regression coefficient
    derived from considering only members of one group to the regression
    coefficient derived from considering only members of the other.  By
    default, the first covariate in the --covar file defines the groups; use
    e.g. "--gxe 3" to base them on the third covariate instead.

  --linear ['perm' | 'mperm='<value>] ['perm-count'] ['set-test']
           [{genotypic | hethom | dominant | recessive | no-snp}]
           ['hide-covar'] [{sex | no-x-sex}] ['interaction'] ['beta']
           ['standard-beta'] ['intercept']
  --logistic ['perm' | 'mperm='<value>] ['perm-count'] ['set-test']
             [{genotypic | hethom | dominant | recessive | no-snp}]
             ['hide-covar'] [{sex | no-x-sex}] ['interaction'] ['beta']
             ['intercept']
    Multi-covariate association analysis on a quantitative (--linear) or
    case/control (--logistic) phenotype.  Normally used with --covar.
    * 'perm' normally causes an adaptive permutation test to be performed on
      the main effect, while 'mperm='<value> starts a max(T) permutation test.
    * 'perm-count' causes the permutation test report to include counts instead
      of frequencies.
    * 'set-test' tests the significance of variant sets.  Requires permutation;
      can be customized with --set-p/--set-r2/--set-max.
    * The 'genotypic' modifier adds an additive effect/dominance deviation 2df
      joint test (0/1/2 and 0/1/0 coding), while 'hethom' uses 0/0/1 and 0/1/0
      coding instead.  If permutation is also requested, these modifiers cause
      permutation to be based on the joint test.
    * 'dominant' and 'recessive' specify a model assuming full dominance or
      recessiveness, respectively, for the A1 allele.
    * 'no-snp' causes regression to be performed only on the phenotype and the
      covariates, without reference to genomic data.  If permutation is also
      requested, results are reported for all covariates.
    * 'hide-covar' removes covariate-specific lines from the report.
    * By default, sex (male = 1, female = 0) is automatically added as a
      covariate on X chromosome variants, and nowhere else.  The 'sex' modifier
      causes it to be added everywhere, while 'no-x-sex' excludes it.
    * 'interaction' adds genotype x covariate interactions to the model.  This
      cannot be used with the usual permutation tests; use --tests to define
      the permutation test statistic instead.
    * 'intercept' causes intercepts to be included in the main report.
    * For logistic regressions, the 'beta' modifier causes regression
      coefficients instead of odds ratios to be reported.
    * With --linear, the 'standard-beta' modifier standardizes the phenotype
      and all predictors to zero mean and unit variance before regression.

  --dosage <allele dosage file> ['noheader'] ['skip0='<i>] ['skip1='<j>]
           ['skip2='<k>] ['dose1'] ['format='<m>] ['Zout']
           [{occur | standard-beta}] ['sex'] ['case-control-freqs']
  --dosage <list file> list [{sepheader | noheader}] ['skip0='<i>]
           ['skip1='<j>] ['skip2='<k>] ['dose1'] ['format='<m>] ['Zout']
           [{occur | standard-beta}] ['sex'] ['case-control-freqs']
  --write-dosage
    Process (possibly gzipped) text files with variant-major allelic dosage
    data.  This cannot be used with a regular input fileset; instead, you must
    *only* specify a .fam and possibly a .map file, and you can't specify any
    other commands.
    * PLINK 2.0 will have first-class support for genotype probabilities.  An
      equivalent data import flag will be provided then, and --dosage will be
      retired.
    * By default, --dosage assumes that only one allelic dosage file should be
      loaded.  To specify multiple files,
      1. create a master list with one entry per line.  There are normally two
         supported formats for this list: just a filename per line, or variant
         batch numbers in the first column and filenames in the second.
      2. Provide the name of that list as the first --dosage parameter.
      3. Add the 'list' modifier.
    * By default, --dosage assumes the allelic dosage file(s) contain a header
      line, which has 'SNP' in column i+1, 'A1' in column i+j+2, 'A2' in column
      i+j+3, and sample FID/IIDs starting from column i+j+k+4.  (i/j/k are
      normally zero, but can be changed with 'skip0', 'skip1', and 'skip2'
      respectively.)  If such a header line is not present,
      * when all samples appear in the same order as they do in the .fam file,
        you can use the 'noheader' modifier.
      * Otherwise, use the 'sepheader' modifier, and append sample ID filenames
        to your 'list' file entries.
    * The 'format=' modifier lets you specify the number of values used to
      represent each dosage.  'format=1' normally indicates a single 0..2 A1
      expected count; 'dose1' modifies this to a 0..1 frequency.  'format=2'
      (the default) indicates a 0..1 homozygous A1 likelihood followed by a
      0..1 het likelihood, while 'format=3' indicates 0..1 hom A1, 0..1 het,
      0..1 hom A2.
    * 'Zout' causes the output file to be gzipped.
    * Normally, an association analysis is performed.  'standard-beta' and
      'sex' behave as they are supposed to with --linear/--logistic.
      'case-control-freqs' causes case and control allele frequencies to be
      reported separately.
    * There are three alternate modes which cause the association analysis to
      be skipped.
      * 'occur' requests a simple variant occurrence report.
      * --write-dosage causes a simple merged file matching the 'format'
        specification (not including 'dose1') to be generated.
      * --score applies a linear scoring system to the dosages.

  --lasso <h2 estimate> [min lambda] ['report-zeroes']
    Estimate variant effect sizes via LASSO regression.  You must provide an
    additive heritability estimate to calibrate the regression.
    Note that this method may require a very large sample size (e.g. hundreds
    of thousands) to be effective on complex polygenic traits.

  --test-missing ['perm' | 'mperm='<value>] ['perm-count'] ['midp']
    Check for association between missingness and case/control status, using
    Fisher's exact test.  (Het. haploids are treated as missing.)
    The 'midp' modifier causes Lancaster's mid-p adjustment to be applied.

  --make-perm-pheno <ct>
    Generate phenotype permutations and write them to disk, without invoking an
    association test.

  --tdt [{exact | exact-midp | poo}] ['perm' | 'mperm='<value>] ['perm-count']
        [{parentdt1 | parentdt2 | pat | mat}] ['set-test']
    Report transmission disequilibrium test statistics, given case/control
    phenotypes and pedigree information.
    * A Mendel error check is performed before the main tests; offending
      genotypes are treated as missing by this analysis.
    * By default, the basic TDT p-value is based on a chi-square test unless
      you request the exact binomial test with 'exact' or 'exact-midp'.
    * 'perm'/'mperm=' requests a family-based adaptive or max(T) permutation
      test.  By default, the permutation test statistic is the basic TDT
      p-value; 'parentdt1'/'parentdt2' cause parenTDT or combined test
      p-values, respectively, to be considered instead.
    * 'set-test' tests the significance of variant sets.  This cannot be used
      with exact tests for now.
    The 'poo' modifier causes a parent-of-origin analysis to be performed
    instead, with transmissions from heterozygous fathers and heterozygous
    mothers considered separately.
    * The parent-of-origin analysis does not currently support exact tests.
    * By default, the permutation test statistic is the absolute
      parent-of-origin test Z score; 'pat'/'mat' cause paternal or maternal TDT
      chi-square statistics, respectively, to be considered instead.

  --qfam ['perm' | 'mperm='<value>] ['perm-count'] ['emp-se']
  --qfam-parents ['perm' | 'mperm='<value>] ['perm-count'] ['emp-se']
  --qfam-between ['perm' | 'mperm='<value>] ['perm-count'] ['emp-se']
  --qfam-total ['perm' | 'mperm='<value>] ['perm-count'] ['emp-se']
    QFAM family-based association test for quantitative traits.
    * A Mendel error check is performed before the main tests; offending
      genotypes are treated as missing by this analysis.
    * This procedure requires permutation.  'perm' and 'perm-count' have the
      usual meanings.  However, 'mperm='<value> just specifies a fixed number
      of permutations; the method does not support a proper max(T) test.
    * The 'emp-se' modifier adds BETA and EMP_SE (empirical standard error for
      beta) fields to the .perm output file.

  --annotate <PLINK report> ['attrib='<file>] ['ranges='<file>]
             ['filter='<file>] ['snps='<file>] [{NA | prune}] ['block']
             ['subset='<file>] ['minimal'] ['distance']
    Add annotations to a variant-based PLINK report.  This requires an
    annotation source:
    * 'attrib='<file> specifies a (possibly gzipped) attribute file.
    * 'ranges='<file> specifies a gene/range list file.
    (Both source types can be specified simultaneously.)  The following options
    are also supported:
    * 'filter='<file> causes only variants within one of the ranges in the file
      to be included in the new report.
    * 'snps='<file> causes only variants named in the file to be included in
      the new report.
    * The 'NA' modifier causes unannotated variants to have 'NA' instead of '.'
      in the new report's ANNOT column, while the 'prune' modifier excludes
      them entirely.
    * The 'block' modifier replaces the single ANNOT column with a 0/1-coded
      column for each possible annotation.
    * With 'ranges',
      * 'subset='<file> causes only intervals named in the subset file to be
        loaded from the ranges file.
      * interval annotations normally come with a parenthesized signed distance
        to the interval boundary (0 if the variant is located inside the
        interval; this is always true without --border).  They can be excluded
        with the 'minimal' modifier.
      * the 'distance' modifier adds 'DIST' and 'SGN' columns describing signed
        distance to the nearest interval.
    * When --pfilter is present, high p-values are filtered out.

  --clump <PLINK report filename(s)...>
    Process association analysis report(s) with 'SNP' and p-value columns,
    organizing results by LD-based clumps.  Multiple filenames can be separated
    by spaces or commas.

  --gene-report <PLINK report> <gene range file>
    Generate a gene-based report from a variant-based report.
    * When --pfilter is present, high p-values are filtered out.
    * When --extract (without 'range') is present, only variants named in the
      --extract file are considered.

  --meta-analysis <PLINK report filenames...>
  --meta-analysis <PLINK report filenames...> + [{logscale | qt}]
                  [{no-map | no-allele}] ['study'] ['report-all']
                  ['weighted-z']
    Perform a meta-analysis on several variant-based reports with 'SNP' and
    'SE' fields.
    * Normally, an 'OR' odds ratio field must also be present in each input
      file.  With 'logscale', 'BETA' log-odds values/regression coefficients
      are expected instead, but the generated report will still contain odds
      ratio estimates.  With 'qt', both input and output values are regression
      betas.
    * 'CHR', 'BP', and 'A1' fields are also normally required.  'no-map' causes
      them to all be ignored, while 'no-allele' causes just 'A1' to be ignored.
    * If 'A2' fields are present, and neither 'no-map' nor 'no-allele' was
      specified, A1/A2 allele flips are handled properly.  Otherwise, A1
      mismatches are thrown out.
    * 'study' causes study-specific effect estimates to be collated in the
      meta-analysis report.
    * 'report-all' causes variants present in only a single input file to be
      included in the meta-analysis report.
    * 'weighted-z' requests weighted Z-score-based p-values (as computed by the
      Abecasis Lab's METAL software) in addition to the usual inverse
      variance-based analysis.  This requires P and effective sample size
      fields.
    * When --extract (without 'range') is present, only variants named in the
      --extract file are considered.
    * Unless 'no-map' is specified, chromosome filters are also respected.

  --fast-epistasis [{boost | joint-effects | no-ueki}] ['case-only']
                   [{set-by-set | set-by-all}] ['nop']
  --epistasis [{set-by-set | set-by-all}]
    Scan for epistatic interactions.  --fast-epistasis inspects 3x3 joint
    genotype count tables and only applies to case/control phenotypes, while
    --epistasis performs linear or logistic regression.
    * By default, --fast-epistasis uses the PLINK 1.07 allele-based test.  Two
      newer tests are now supported: 'boost' invokes the likelihood ratio test
      introduced by Wan X et al. (2010) BOOST: A Fast Approach to Detecting
      Gene-Gene Interactions in Genome-wide Case-Control Studies, while
      'joint-effects' applies the joint effects test introduced in Ueki M,
      Cordell HJ (2012) Improved statistics for genome-wide interaction
      analysis.
    * The original --fast-epistasis test normally applies the variance and
      empty cell corrections suggested by Ueki and Cordell's paper.  To disable
      them, use the 'no-ueki' modifier.
    * 'case-only' requests a case-only instead of a case/control test.
    * By default, all pairs of variants across the entire genome are tested.
      To just test pairs of variants within a single set, add the 'set-by-set'
      modifier and load exactly one set with --set/--make-set; with exactly two
      sets loaded, all variants in one set are tested against all variants in
      the other.  'set-by-all' tests all variants in one set against the entire
      genome instead.
    * 'nop' strips p-values from the main report.
    * These computations can be subdivided with --parallel; however...
  --epistasis-summary-merge <common file prefix> <ct>
    When a --[fast-]epistasis job is subdivided with --parallel, the main
    report can be assembled at the end by applying Unix 'cat' in the usual
    manner, but the .summary.1, .summary.2, ... files may require a specialized
    merge.  --epistasis-summary-merge takes care of the latter.

  --twolocus <variant ID> <variant ID>
    Two-locus joint genotype count report.

  --score <filename> [i] [j] [k] ['header'] [{sum | no-sum}]
          [{no-mean-imputation | center}] ['include-cnt'] ['double-dosage']
    Apply a linear scoring system to each sample.
    The input file should have one line per scored variant.  Variant IDs are
    read from column #i, allele codes are read from column #j, and scores are
    read from column #k, where i defaults to 1, j defaults to i+1, and k
    defaults to j+1.
    * The 'header' modifier causes the first nonempty line of the input file to
      be ignored; otherwise, --score assumes there is no header line.
    * By default, final scores are averages of the valid per-variant scores.
      The 'sum' modifier causes sums to be reported instead.  (This cannot be
      used with 'no-mean-imputation'.  And for backward compatibility, 'sum' is
      automatically on with dosage data unless 'no-sum' is specified.)
    * By default, copies of the unnamed allele contribute zero to score, while
      missing genotypes contribute an amount proportional to the loaded (via
      --read-freq) or imputed allele frequency.  To throw out missing
      observations instead (decreasing the denominator in the final average
      when this happens), use the 'no-mean-imputation' modifier.
    * Alternatively, you can use the 'center' modifier to shift all scores to
      mean zero.
    * This command can be used with dosage data.  By default, the 'CNT' column
      is omitted from the output file in this case; use 'include-cnt' to keep
      it.  Also, note that scores are multiplied by 0..1 dosages, not 0..2
      diploid allele counts, unless the 'double-dosage' modifier is present.

  --R <R script file> ['debug']
    Connect to a Rserve (preferably version 1.7 or later) background process,
    and execute the Rplink function defined in the input file.  (Unless the
    'debug' modifier is present; in that case, the R commands that PLINK would
    have tried to execute are logged to a file.)

  --write-var-ranges <block ct>
    Divide the set of variants into equal-size blocks.  (Can be used with
    --snps to split a job across multiple machines.)

The following other flags are supported.  (Order of operations is described at
https://www.cog-genomics.org/plink/1.9/order .)
  --script <fname> : Include command-line options from file.
  --rerun [log]    : Rerun commands in log (default 'plink.log').
  --version        : Display only version number before exiting.
  --silent         : Suppress output to console.
  --gplink         : Reserved for interoperation with gPLINK.
  --missing-genotype <char> : Set missing genotype code (normally '0').
  --double-id          : Set both FIDs and IIDs to the VCF/BCF sample ID.
  --const-fid [ID]     : Set all FIDs to the given constant (default '0').
  --id-delim [d]       : Parse sample IDs as <FID><d><IID> (default delim '_').
  --vcf-idspace-to <c> : Convert spaces in sample IDs to the given character.
  --biallelic-only ['strict'] ['list'] : Skip VCF variants with 2+ ALT alleles.
  --vcf-min-qual <val>           : Skip VCF variants with low/missing QUAL.
  --vcf-filter [exception(s)...] : Skip variants which have FILTER failures.
  --vcf-require-gt               : Skip variants with no GT field.
  --vcf-min-gq <val>             : No-call a genotype when GQ is below the
                                   given threshold.
  --vcf-min-gp <val>             : No-call a genotype when 0-1 scaled GP is
                                   below the given threshold.
  --vcf-half-call <m>  : Specify how '0/.' and similar VCF GT values should be
                         handled.  The following four modes are supported:
                         * 'error'/'e' (default) errors out and reports line #.
                         * 'haploid'/'h' treats them as haploid calls.
                         * 'missing'/'m' treats them as missing.
                         * 'reference'/'r' treats the missing value as 0.
  --oxford-single-chr <chr nm> : Specify single-chromosome .gen file with
                                 ignorable first column.
  --oxford-pheno-name <col nm> : Import named phenotype from the .sample file.
  --hard-call-threshold <val>  : When an Oxford-format fileset is loaded, calls
  --hard-call-threshold random   with uncertainty level greater than 0.1 are
                                 normally treated as missing.  You can adjust
                                 this threshold by providing a numeric
                                 parameter, or randomize all calls with
                                 'random'.
  --missing-code [string list] : Comma-delimited list of missing phenotype
    (alias: --missing_code)      values for Oxford-format filesets (def. 'NA').
  --simulate-ncases <num>   : Set --simulate case count (default 1000).
  --simulate-ncontrols <n>  : Set --simulate control count (default 1000).
  --simulate-prevalence <p> : Set --simulate disease prevalence (default 0.01).
  --simulate-n <num>        : Set --simulate-qt sample count (default 1000).
  --simulate-label <prefix> : Set --simulate[-qt] FID/IID name prefix.
  --simulate-missing <freq> : Set --simulate[-qt] missing genotype frequency.
  --allow-extra-chr ['0']   : Permit unrecognized chromosome codes.  The '0'
    (alias: --aec)            modifier causes them to be treated as if they had
                              been set to zero.
  --chr-set <autosome ct> ['no-x'] ['no-y'] ['no-xy'] ['no-mt'] :
    Specify a nonhuman chromosome set.  The first parameter sets the number of
    diploid autosome pairs if positive, or haploid chromosomes if negative.
    Given diploid autosomes, the remaining modifiers indicate the absence of
    the named non-autosomal chromosomes.
  --cow/--dog/--horse/--mouse/--rice/--sheep : Shortcuts for those species.
  --autosome-num <value>    : Alias for "--chr-set <value> no-y no-xy no-mt".
  --cm-map <fname pattern> [chr] : Use SHAPEIT-format recombination maps to set
                                   centimorgan positions.  To process more than
                                   one chromosome, include a '@' in the first
                                   parameter where the chrom. number belongs,
                                   e.g. 'genetic_map_chr@_combined_b37.txt'.
  --zero-cms         : Zero out centimorgan positions.
  --allow-no-samples : Allow the input fileset to contain no samples.
  --allow-no-vars    : Allow the input fileset to contain no variants.
  --pheno <fname>  : Load phenotype data from the specified file, instead of
                     using the values in the main input fileset.
  --all-pheno      : For basic association tests, loop through all phenotypes
                     in --pheno file.
  --mpheno <n>     : Load phenotype from column (n+2) in --pheno file.
  --pheno-name <c> : If --pheno file has a header row, use column with the
                     given name.
  --pheno-merge    : When the main input fileset contains an phenotype value
                     for a sample, but the --pheno file does not, use the
                     original value instead of treating the phenotype as
                     missing.
  --missing-phenotype <v> : Set missing phenotype value (normally -9).
  --1                     : Expect case/control phenotypes to be coded as
                            0 = control, 1 = case, instead of the usual
                            0 = missing, 1 = control, 2 = case.  This also
                            forces phenotypes to be interpreted as case/ctrl.
  --make-pheno <fn> <val> : Define a new case/control phenotype.  If the val
                            parameter is '*', all samples listed in the given
                            file are cases, and everyone else is a control.
                            (Note that, in some shells, it is necessary to
                            surround the * with quotes.)
                            Otherwise, all samples with third column entry
                            equal to the val parameter are cases, and all other
                            samples mentioned in the file are controls.
  --tail-pheno <Lt> [Hbt] : Downcode a scalar phenotype to a case/control
                            phenotype.  All samples with phenotype values
                            greater than Hbt are cases, and all with values
                            less than or equal to Lt are controls.  If Hbt is
                            unspecified, it is equal to Lt; otherwise,
                            in-between phenotype values are set to missing.
  --covar <filename> ['keep-pheno-on-missing-cov'] : Specify covariate file.
  --covar-name <...>       : Specify covariate(s) in --covar file by name.
                             Separate multiple names with spaces or commas, and
                             use dashes to designate ranges.
  --covar-number <...>     : Specify covariate(s) in --covar file by index.
  --no-const-covar         : Exclude constant covariates.
  --allow-no-covars        : Allow no covariates to be loaded from --covar
                             file.
  --within <f> ['keep-NA'] : Specify initial cluster assignments.
  --mwithin <n>            : Load cluster assignments from column n+2.
  --family                 : Create a cluster for each family ID.
  --loop-assoc <f> ['keep-NA']  : Run specified case/control association
                                  commands once for each cluster in the file,
                                  using cluster membership as the phenotype.
  --set <filename>              : Load sets from a .set file.
  --set-names <name(s)...>      : Load only sets named on the command line.
                                  Use spaces to separate multiple names.
  --subset <filename>           : Load only sets named in the given text file.
  --set-collapse-all <set name> : Merge all sets.
  --complement-sets             : Invert all sets.  (Names gain 'C_' prefixes.)
  --make-set-complement-all <s> : --set-collapse-all + inversion.
  --make-set <filename>         : Define sets from a list of named bp ranges.
  --make-set-border <kbs>       : Stretch regions in --make-set file.
  --make-set-collapse-group     : Define sets from groups instead of sets in
                                  --make-set file.
  --keep <filename>       : Exclude all samples not named in the file.
  --remove <filename>     : Exclude all samples named in the file.
  --keep-fam <filename>   : Exclude all families not named in the file.
  --remove-fam <filename> : Exclude all families named in the file.
  --extract ['range'] <f> : Exclude all variants not named in the file.
  --exclude ['range'] <f> : Exclude all variants named in the file.
  --keep-clusters <filename>          : These can be used individually or in
  --keep-cluster-names <name(s)...>     combination to define a list of
                                        clusters to keep; all samples not in a
                                        cluster in that list are then excluded.
                                        Use spaces to separate cluster names
                                        for --keep-cluster-names.
  --remove-clusters <filename>        : Exclude all clusters named in the file.
  --remove-cluster-names <name(s)...> : Exclude the named clusters.
  --gene <sets...> : Exclude variants not in a set named on the command line.
                     (Separate multiple set names with spaces.)
  --gene-all       : Exclude variants which aren't a member of any set.  (PLINK
                     1.07 automatically did this under some circumstances.)
  --attrib <f> [att lst] : Given a file assigning attributes to variants, and a
  --attrib-indiv <f> [a]   comma-delimited list (with no whitespace) of
                           attribute names, remove variants/samples which are
                           either missing from the file or don't have any of
                           the listed attributes.  If some attribute names in
                           the list are preceded by '-', they are treated as
                           "negative match conditions" instead: variants with
                           at least one negative match attribute are removed.
                           The first character in the list cannot be a '-', due
                           to how command-line parsing works; add a comma in
                           front to get around this.
  --chr <chrs...>  : Exclude all variants not on the given chromosome(s).
                     Valid choices for humans are 0 (unplaced), 1-22, X, Y, XY,
                     and MT.  Separate multiple chromosomes with spaces and/or
                     commas, and use a dash (no adjacent spaces permitted) to
                     denote a range, e.g. "--chr 1-4, 22, xy".
  --not-chr <...>  : Reverse of --chr (exclude variants on listed chromosomes).
  --autosome       : Exclude all non-autosomal variants.
  --autosome-xy    : Exclude all non-autosomal variants, except those with
                     chromosome code XY (pseudo-autosomal region of X).
  --snps-only ['just-acgt'] : Exclude non-SNP variants.  By default, SNP = both
                              allele codes are single-character; 'just-acgt'
                              restricts codes to {A,C,G,T,a,c,g,t,<missing>}.
  --from <var ID>  : Use ID(s) to specify a variant range to load.  When used
  --to   <var ID>    together, both variants must be on the same chromosome.
  --snp  <var ID>  : Specify a single variant to load.
  --exclude-snp <> : Specify a single variant to exclude.
  --window  <kbs>  : With --snp or --exclude-snp, loads/excludes all variants
                     within half the specified kb distance of the named one.
  --from-bp <pos>  : Use physical position(s) to define a variant range to
  --to-bp   <pos>    load.  --from-kb/--to-kb/--from-mb/--to-mb allow decimal
  --from-kb <pos>    values.  You must also specify a single chromosome (using
  --to-kb   <pos>    e.g. --chr) when using these flags.
  --from-mb <pos>
  --to-mb   <pos>
  --snps <var IDs...>  : Use IDs to specify variant range(s) to load or
  --exclude-snps <...>   exclude.  E.g. "--snps rs1111-rs2222, rs3333, rs4444".
  --thin <p>       : Randomly remove variants, retaining each with prob. p.
  --thin-count <n> : Randomly remove variants until n of them remain.
  --bp-space <bps> : Remove variants so that each pair is no closer than the
                     given bp distance.  (Equivalent to VCFtools --thin.)
  --thin-indiv <p>         : Randomly remove samples, retaining with prob. p.
  --thin-indiv-count <n>   : Randomly remove samples until n of them remain.
  --filter <f> <val(s)...> : Exclude all samples without a 3rd column entry in
                             the given file matching one of the given
                             space-separated value(s).
  --mfilter <n>            : Match against (n+2)th column instead.
  --geno [val]     : Exclude variants with missing call frequencies greater
                     than a threshold (default 0.1).  (Note that the default
                     threshold is only applied if --geno is invoked without a
                     parameter; when --geno is not invoked, no per-variant
                     missing call frequency ceiling is enforced at all.  Other
                     inclusion/exclusion default thresholds work the same way.)
  --mind [val]     : Exclude samples with missing call frequencies greater than
                     a threshold (default 0.1).
  --oblig-missing <f1> <f2> : Specify blocks of missing genotype calls for
                              --geno/--mind to ignore.  The first file should
                              have variant IDs in the first column and block
                              IDs in the second, while the second file should
                              have FIDs in the first column, IIDs in the
                              second, and block IDs in the third.
  --prune             : Remove samples with missing phenotypes.
  --maf [freq]        : Exclude variants with minor allele frequency lower than
                        a threshold (default 0.01).
  --max-maf <freq>    : Exclude variants with MAF greater than the threshold.
  --mac <ct>          : Exclude variants with minor allele count lower than the
    (alias: --min-ac)   given threshold.
  --max-mac <ct>      : Exclude variants with minor allele count greater than
    (alias: --max-ac)   the given threshold.
  --maf-succ       : Rule of succession MAF estimation (used in EIGENSOFT).
                     Given j observations of one allele and k >= j observations
                     of the other, infer a MAF of (j+1) / (j+k+2), rather than
                     the default j / (j+k).
  --read-freq <fn> : Estimate MAFs and heterozygote frequencies from the given
                     --freq[x] report, instead of the input fileset.
  --hwe <p> ['midp'] ['include-nonctrl'] : Exclude variants with Hardy-Weinberg
                                           equilibrium exact test p-values
                                           below a threshold.
  --me <t> <v> ['var-first'] : Filter out trios and variants with Mendel error
                               rates exceeding the given thresholds.
  --me-exclude-one [ratio]   : Make --me exclude only one sample per trio.
  --qual-scores <f> [qcol] [IDcol] [skip] : Filter out variants with
                                            out-of-range quality scores.
                                            Default range is now [0, \infty ).
  --qual-threshold <min qual score>       : Set --qual-scores range floor.
  --qual-max-threshold <max qual score>   : Set --qual-scores range ceiling.
  --allow-no-sex   : Do not treat ambiguous-sex samples as having missing
                     phenotypes in analysis commands.  (Automatic /w --no-sex.)
  --must-have-sex  : Force ambiguous-sex phenotypes to missing on
                     --make-bed/--make-just-fam/--recode/--write-covar.
  --filter-cases       : Include only cases in the current analysis.
  --filter-controls    : Include only controls.
  --filter-males       : Include only males.
  --filter-females     : Include only females.
  --filter-founders    : Include only founders.
  --filter-nonfounders : Include only nonfounders.
  --nonfounders        : Include nonfounders in allele freq/HWE calculations.
  --make-founders ['require-2-missing'] ['first'] :
    Clear parental IDs for those with 1+ missing parent(s).
  --recode-allele <fn> : With --recode A/A-transpose/AD, count alleles named in
                         the file (otherwise A1 alleles are always counted).
  --output-chr <MT code> : Set chromosome coding scheme in output files by
                           providing the desired human mitochondrial code.
                           (Options are '26', 'M', 'MT', '0M', 'chr26', 'chrM',
                           and 'chrMT'.)
  --output-missing-genotype <ch> : Set the code used to represent missing
                                   genotypes in output files (normally the
                                   --missing-genotype value).
  --output-missing-phenotype <s> : Set the string used to represent missing
                                   phenotypes in output files (normally the
                                   --missing-phenotype value).
  --zero-cluster <f> : In combination with --within/--family, set blocks of
                       genotype calls to missing.  The input file should have
                       variant IDs in the first column and cluster IDs in the
                       second.  This must now be used with --make-bed and no
                       other output commands.
  --set-hh-missing       : Cause --make-bed and --recode to set heterozygous
                           haploid genotypes to missing.
  --set-mixed-mt-missing : Cause --make-bed and --recode to set mixed MT
                           genotypes to missing.
  --split-x <bp1> <bp2> ['no-fail']
  --split-x <build> ['no-fail'] :
    Changes chromosome code of all chrX variants with bp position <= bp1 or >=
    bp2 to XY.  The following build codes are supported as shorthand:
    * 'b36'/'hg18' = NCBI 36, 2709521/154584237
    * 'b37'/'hg19' = GRCh37, 2699520/154931044
    * 'b38'/'hg38' = GRCh38, 2781479/155701383
    By default, PLINK errors out when no variants would be affected by
    --split-x; the 'no-fail' modifier (useful in scripts) overrides this.
  --merge-x ['no-fail'] : Merge XY chromosome back with X.
  --set-me-missing      : Cause --make-bed to set Mendel errors to missing.
  --fill-missing-a2     : Cause --make-bed to replace all missing calls with
                          homozygous A2 calls.
  --set-missing-var-ids <t>   : Given a template string with a '@' where the
                                chromosome code should go and '#' where the bp
                                coordinate belongs, --set-missing-var-ids
                                assigns chromosome-and-bp-based IDs to unnamed
                                variants.
                                You may also use '$1' and '$2' to refer to
                                allele names in the template string, and in
                                fact this becomes essential when multiple
                                variants share the same coordinate.
  --new-id-max-allele-len <n> : Specify maximum number of leading characters
                                from allele names to include in new variant IDs
                                (default 23).
  --missing-var-code <string> : Change unnamed variant code (default '.').
  --update-chr  <f> [chrcol] [IDcol]  [skip] : Update variant chromosome codes.
  --update-cm   <f> [cmcol]  [IDcol]  [skip] : Update centimorgan positions.
  --update-map  <f> [bpcol]  [IDcol]  [skip] : Update variant bp positions.
  --update-name <f> [newcol] [oldcol] [skip] : Update variant IDs.
  --update-alleles <fname> : Update variant allele codes.
  --allele1234 ['multichar'] : Interpret/recode A/C/G/T alleles as 1/2/3/4.
                               With 'multichar', converts all A/C/G/Ts in
                               allele names to 1/2/3/4s.
  --alleleACGT ['multichar'] : Reverse of --allele1234.
  --update-ids <f>     : Update sample IDs.
  --update-parents <f> : Update parental IDs.
  --update-sex <f> [n] : Update sexes.  Sex (1 or M = male, 2 or F = female, 0
                         = missing) is loaded from column n+2 (default n is 1).
  --flip <filename>    : Flip alleles (A<->T, C<->G) for SNP IDs in the file.
  --flip-subset <fn>   : Only apply --flip to samples in --flip-subset file.
  --flip-scan-window <ct+1> : Set --flip-scan max variant ct dist. (def. 10).
  --flip-scan-window-kb <x> : Set --flip-scan max kb distance (default 1000).
  --flip-scan-threshold <x> : Set --flip-scan min correlation (default 0.5).
  --keep-allele-order  : Keep the allele order defined in the .bim file,
  --real-ref-alleles     instead of forcing A2 to be the major allele.
                         --real-ref-alleles also removes 'PR' from the INFO
                         values emitted by --recode vcf{,-fid,-iid}.
  --a1-allele <f> [a1col] [IDcol] [skip] : Force alleles in the file to A1.
  --a2-allele <filename> [a2col] [IDcol] [skip] :
    Force alleles in the file to A2.  ("--a2-allele <VCF filename> 4 3 '#'",
    which scrapes reference allele assignments from a VCF file, is especially
    useful.)
  --indiv-sort <m> [f] : Specify FID/IID sort order.  The following four modes
                         are supported:
                         * 'none'/'0' keeps samples in the order they were
                           loaded.  Default for non-merge operations.
                         * 'natural'/'n' invokes 'natural sort', e.g.
                           'id2' < 'ID3' < 'id10'.  Default when merging.
                         * 'ascii'/'a' sorts in ASCII order, e.g.
                           'ID3' < 'id10' < 'id2'.
                         * 'file'/'f' uses the order in the given file (named
                           in the second parameter).
                         For now, only --merge/--bmerge/--merge-list and
                         --make-bed/--make-just-fam respect this flag.
  --with-phenotype ['no-parents'] [{no-sex | female-2}] :
    Include more sample info in new .cov file.
  --dummy-coding [N] ['no-round'] : Split categorical variables (n categories,
                                    2 < n <= N, default N is 49) into n-1
                                    binary dummy variables when writing
                                    covariate file.
  --merge-mode <n>   : Adjust --[b]merge/--merge-list behavior based on a
                       numeric code.
                       1 (default) = ignore missing calls, otherwise difference
                                     -> missing
                       2 = only overwrite originally missing calls
                       3 = only overwrite when nonmissing in new file
                       4/5 = never overwrite and always overwrite, respectively
                       6 = report all mismatching calls without merging
                       7 = report mismatching nonmissing calls without merging
  --merge-equal-pos  : With --merge/--bmerge/--merge-list, merge variants with
                       different names but identical positions.  (Exception:
                       same-position chromosome code 0 variants aren't merged.)
  --mendel-duos      : Make Mendel error checks consider samples with only one
                       parent in the dataset.
  --mendel-multigen  : Make Mendel error checks consider (great-)grandparental
                       genotypes when parental genotype data is missing.
  --ld-window <ct+1> : Set --r/--r2 max variant ct pairwise distance (usu. 10).
  --ld-window-kb <x> : Set --r/--r2 max kb pairwise distance (usually 1000).
  --ld-window-cm <x> : Set --r/--r2 max centimorgan pairwise distance.
  --ld-window-r2 <x> : Set threshold for --r2 report inclusion (usually 0.2).
  --ld-snp <var ID>  : Set first variant in all --r/--r2 pairs.
  --ld-snps <vID...> : Restrict first --r/--r2 variant to the given ranges.
  --ld-snp-list <f>  : Restrict first --r/--r2 var. to those named in the file.
  --list-all         : Generate the 'all' mode report when using --show-tags in
                       file mode.
  --tag-kb <kbs>     : Set --show-tags max tag kb distance (default 250).
  --tag-r2 <val>     : Set --show-tags min tag r-squared (default 0.8)
  --tag-mode2        : Use two-column --show-tags (file mode) I/O format.
  --ld-xchr <code>   : Set chrX model for --indep[-pairwise], --r/--r2,
                       --flip-scan, and --show-tags.
                       1 (default) = males coded 0/1, females 0/1/2 (A1 dosage)
                       2 = males coded 0/2
                       3 = males coded 0/2, but females given double weighting
  --blocks-max-kb <kbs>      : Set --blocks maximum haploblock span (def. 200).
  --blocks-min-maf <cutoff>  : Adjust --blocks MAF minimum (default 0.05).
  --blocks-strong-lowci <x>  : Set --blocks "strong LD" CI thresholds (defaults
  --blocks-strong-highci <x>   0.70 and 0.98).
  --blocks-recomb-highci <x> : Set 'recombination' CI threshold (default 0.90).
  --blocks-inform-frac <x>   : Force haploblock <strong LD pairs>:<total
                               informative pairs> ratios to be larger than this
                               value (default 0.95).
  --distance-wts exp=<x>        : When computing genomic distances, assign each
                                  variant a weight of (2q(1-q))^{-x}, where q
                                  is the loaded or inferred MAF.
  --read-dists <dist file> [id file] : Load a triangular binary distance matrix
                                       instead of recalculating from scratch.
  --ppc-gap <val>    : Minimum number of base pairs, in thousands, between
                       informative pairs of markers used in --genome PPC test.
                       500 if unspecified.
  --min <cutoff>     : Specify minimum PI_HAT for inclusion in --genome report.
  --max <cutoff>     : Specify maximum PI_HAT for inclusion in --genome report.
  --homozyg-match <> : Set minimum concordance across jointly homozygous
                       variants for a pairwise allelic match to be declared.
  --pool-size <ct>   : Set minimum size of pools in "--homozyg group" report.
  --read-genome <fn> : Load --genome report for --cluster/--neighbour, instead
                       of recalculating IBS and PPC test p-values from scratch.
  --ppc <p-val>    : Specify minimum PPC test p-value within a cluster.
  --mc <max size>  : Specify maximum cluster size.
  --mcc <c1> <c2>  : Specify maximum case and control counts per cluster.
  --K <min count>  : Specify minimum cluster count.
  --ibm <val>      : Specify minimum identity-by-missingness.
  --match <f> [mv] : Use covariate values to restrict clustering.  Without
                     --match-type, two samples can only be in the same cluster
                     if all covariates match.  The optional second parameter
                     specifies a covariate value to treat as missing.
  --match-type <f> : Refine interpretation of --match file.  The --match-type
                     file is expected to be a single line with as many entries
                     as the --match file has covariates; '0' entries specify
                     "negative matches" (i.e. samples with equal covariate
                     values cannot be in the same cluster), '1' entries specify
                     "positive matches" (default), and '-1' causes the
                     corresponding covariate to be ignored.
  --qmatch <f> [m] : Force all members of a cluster to have similar
  --qt <fname>       quantitative covariate values.  The --qmatch file contains
                     the covariate values, while the --qt file is a list of
                     nonnegative tolerances (and '-1's marking covariates to
                     skip).
  --pca-cluster-names <...> : These can be used individually or in combination
  --pca-clusters <fname>      to define a list of clusters to use in the basic
                              --pca computation.  (--pca-cluster-names expects
                              a space-delimited sequence of cluster names,
                              while --pca-clusters expects a file with one
                              cluster name per line.)  All samples outside
                              those clusters will then be projected on to the
                              calculated PCs.
  --mds-plot <dims> ['by-cluster'] ['eigendecomp'] ['eigvals'] :
    Multidimensional scaling analysis.  Requires --cluster.
  --cell <thresh>  : Skip some --model tests when a contingency table entry is
                     smaller than the given threshold.
  --condition <var ID> [{dominant | recessive}] : Add one variant as a --linear
                                                  or --logistic covariate.
  --condition-list <f> [{dominant | recessive}] : Add variants named in the
                                                  file as --linear/--logistic
                                                  covariates.
  --parameters <...>  : Include only the given covariates/interactions in the
                        --linear/--logistic models, identified by a list of
                        1-based indices and/or ranges of them.
  --tests <all> [...] : Perform a (joint) test on the specified term(s) in the
                        --linear/--logistic model, identified by 1-based
                        indices and/or ranges of them.  If permutation was
                        requested, it is based on this test.
                        * Note that, when --parameters is also present, the
                          indices refer to the terms remaining AFTER pruning by
                          --parameters.
                        * You can use "--tests all" to include all terms.
  --vif <max VIF>     : Set VIF threshold for --linear multicollinearity check
                        (default 50).
  --xchr-model <code> : Set the X chromosome --linear/--logistic model.
                        0 = skip sex and haploid chromosomes
                        1 (default) = add sex as a covariate on X chromosome
                        2 = code male genotypes 0/2 instead of 0/1
                        3 = test for interaction between genotype and sex
  --lasso-select-covars [cov(s)...] : Subject some or all covariates to LASSO
                                      model selection.
  --adjust ['gc'] ['log10'] ['qq-plot'] : Report some multiple-testing
                                          corrections.
  --lambda <val>   : Set genomic control lambda for --adjust.
  --ci <size>      : Report confidence intervals for odds ratios.
  --pfilter <val>  : Filter out association test results with higher p-values.
  --aperm <min perms - 1> [max perms] [alpha] [beta] [init interval] [slope] :
    Set up to six parameters controlling adaptive permutation tests.
    * The first two control the minimum and maximum number of permutations that
      may be run for each variant; default values are 5 and 1000000.
    * The next two control the early termination condition.  A
      100% * (1 - beta/2T) confidence interval is calculated for each empirical
      p-value, where T is the total number of variants; whenever this
      confidence interval doesn't contain alpha, the variant is exempted from
      further permutation testing.  Default values are 0 and 1e-4.
    * The last two control when the early termination condition is checked.  If
      a check occurs at permutation #p, the next check occurs after
      <slope>p + <init interval> more permutations (rounded down).  Default
      initial interval is 1, and default slope is 0.001.
  --mperm-save     : Save best max(T) permutation test statistics.
  --mperm-save-all : Save all max(T) permutation test statistics.
  --set-p <p-val>        : Adjust set test significant variant p-value ceiling
                           (default 0.05).
  --set-r2 [v] ['write'] : Adjust set test significant variant pairwise r^2
                           ceiling (default 0.5).  'write' causes violating
                          pairs to be dumped to <output prefix>.ldset.
  --set-max <ct>         : Adjust set test maximum # of significant variants
                           considered per set (default 5).
  --set-test-lambda <v>  : Specify genomic control correction for set test.
  --border <kbs>            : Extend --annotate range intervals by given # kbs.
  --annotate-snp-field <nm> : Set --annotate variant ID field name.
  --clump-p1 <pval> : Set --clump index var. p-value ceiling (default 1e-4).
  --clump-p2 <pval> : Set --clump secondary p-value threshold (default 0.01).
  --clump-r2 <r^2>  : Set --clump r^2 threshold (default 0.5).
  --clump-kb <kbs>  : Set --clump kb radius (default 250).
  --clump-snp-field <n...>  : Set --clump variant ID field name (default
                              'SNP').  With multiple field names, earlier names
                              take precedence over later ones.
  --clump-field <name...>   : Set --clump p-value field name (default 'P').
  --clump-allow-overlap     : Let --clump non-index vars. join multiple clumps.
  --clump-verbose           : Request extended --clump report.
  --clump-annotate <hdr...> : Include named extra fields in --clump-verbose and
                              --clump-best reports.  (Field names can be
                              separated with spaces or commas.)
  --clump-range <filename>  : Report overlaps between clumps and regions.
  --clump-range-border <kb> : Stretch regions in --clump-range file.
  --clump-index-first       : Extract --clump index vars. from only first file.
  --clump-replicate         : Exclude clumps which contain secondary results
                              from only one file.
  --clump-best              : Report best proxy for each --clump index var.
  --meta-analysis-chr-field <n...> : Set --meta-analysis chromosome, variant
  --meta-analysis-snp-field <n...>   ID, position, A1/A2 allele, p-value,
  --meta-analysis-bp-field <n...>    standard error, and/or effective sample
  --meta-analysis-a1-field <n...>    size field names.
  --meta-analysis-a2-field <n...>    Defaults are 'CHR', 'SNP', 'BP', 'A1',
  --meta-analysis-p-field <n...>     'A2', 'P', 'SE', and 'NMISS',
  --meta-analysis-se-field <n...>    respectively.  When multiple parameters
  --meta-analysis-ess-field <n...>   are given to these flags, earlier names
                                     take precedence over later ones.
                                     Note that, if the numbers of cases and
                                     controls are unequal, effective sample
                                     size should be
                                       4 / (1/<# cases> + 1/<# controls>).
  --meta-analysis-report-dups      : When a variant appears multiple times in
                                     in the same file, report that.
  --gene-list-border <kbs>   : Extend --gene-report regions by given # of kbs.
  --gene-subset <filename>   : Specify gene name subset for --gene-report.
  --gene-report-snp-field <> : Set --gene-report variant ID field name (default
                               'SNP').  Only relevant with --extract.
  --gap <kbs>      : Set "--fast-epistasis case-only" min. gap (default 1000).
  --epi1 <p-value> : Set --[fast-]epistasis reporting threshold (default
                     5e-6 for 'boost', 1e-4 otherwise).
  --epi2 <p-value> : Set threshold for contributing to SIG_E count (def. 0.01).
  --je-cellmin <n> : Set required number of observations per 3x3x2 contingency
                     table cell for joint-effects test (default 5).
  --q-score-range <range file> <data file> [i] [j] ['header'] :
    Apply --score to subset(s) of variants in the primary score list based
    on e.g. p-value ranges.
    * The first file should have range labels in the first column, p-value
      lower bounds in the second column, and upper bounds in the third column.
      Lines with too few entries, or nonnumeric values in the second or third
      column, are ignored.
    * The second file should contain a variant ID and a p-value on each
      nonempty line (except possibly the first).  Variant IDs are read from
      column #i and p-values are read from column #j, where i defaults to 1 and
      j defaults to i+1.  The 'header' modifier causes the first nonempty line
      of this file to be skipped.
  --R-port <port #>  : Connect to Rserve on a port other than 6311.
  --R-host <host>    : Connect to Rserve host.
  --R-socket <sock>  : Connect to Rserve socket.
  --parallel <k> <n> : Divide the output matrix into n pieces, and only compute
                       the kth piece.  The primary output file will have the
                       piece number included in its name, e.g. plink.rel.13 or
                       plink.rel.13.gz if k is 13.  Concatenating these files
                       in order will yield the full matrix of interest.  (Yes,
                       this can be done before unzipping.)
                       N.B. This generally cannot be used to directly write a
                       symmetric square matrix.  Choose square0 or triangle
                       shape instead, and postprocess as necessary.
  --memory <val>     : Set size, in MB, of initial workspace malloc attempt.
                       (Practically mandatory when using GNU parallel.)
  --threads <val>    : Set maximum number of concurrent threads.
                       This has one known limitation: some BLAS/LAPACK linear
                       algebra operations are multithreaded in a way that PLINK
                       cannot control.  If this is problematic, you should
                       recompile against single-threaded BLAS/LAPACK.
  --d <char>         : Change variant/covariate range delimiter (normally '-').
  --seed <val...>    : Set random number seed(s).  Each value must be an
                       integer between 0 and 4294967295 inclusive.
  --perm-batch-size <val> : Set number of permutations per batch for some
                            permutation tests.
  --native           : Allow Intel MKL to use processor-dependent code paths.
  --output-min-p <p> : Specify minimum p-value to write to reports.
  --debug            : Use slower, more crash-resistant logging method.

Primary methods paper:
Chang CC, Chow CC, Tellier LCAM, Vattikuti S, Purcell SM, Lee JJ (2015)
Second-generation PLINK: rising to the challenge of larger and richer datasets.
GigaScience, 4.

For further documentation and support, consult the main webpage
(https://www.cog-genomics.org/plink/1.9 ) and/or the mailing list
(https://groups.google.com/d/forum/plink2-users ).
PLINK v1.90b6.21 64-bit (19 Oct 2020)          www.cog-genomics.org/plink/1.9/
(C) 2005-2020 Shaun Purcell, Christopher Chang   GNU General Public License v3

In the command line flag definitions that follow,
  * <angle brackets> denote a required parameter, where the text between the
    angle brackets describes its nature.
  * ['square brackets + single-quotes'] denotes an optional modifier.  Use the
    EXACT text in the quotes.
  * [{bar|separated|braced|bracketed|values}] denotes a collection of mutually
    exclusive optional modifiers (again, the exact text must be used).  When
    there are no outer square brackets, one of the choices must be selected.
  * ['quoted_text='<description of value>] denotes an optional modifier that
    must begin with the quoted text, and be followed by a value with no
    whitespace in between.  '|' may also be used here to indicate mutually
    exclusive options.
  * [square brackets without quotes or braces] denote an optional parameter,
    where the text between the brackets describes its nature.
  * An ellipsis (...) indicates that you may enter multiple parameters of the
    specified type.

  plink <input flag(s)...> [command flag(s)...] [other flag(s)...]
  plink --help [flag name(s)...]

Most PLINK runs require exactly one main input fileset.  The following flags
are available for defining its form and location:

  --bfile [prefix] : Specify .bed + .bim + .fam prefix (default 'plink').
  --bed <filename> : Specify full name of .bed file.
  --bim <filename> : Specify full name of .bim file.
  --fam <filename> : Specify full name of .fam file.

  --keep-autoconv  : With --file/--tfile/--lfile/--vcf/--bcf/--data/--23file,
                     don't delete autogenerated binary fileset at end of run.

  --file [prefix]  : Specify .ped + .map filename prefix (default 'plink').
  --ped <filename> : Specify full name of .ped file.
  --map <filename> : Specify full name of .map file.

  --no-fid         : .fam/.ped file does not contain column 1 (family ID).
  --no-parents     : .fam/.ped file does not contain columns 3-4 (parents).
  --no-sex         : .fam/.ped file does not contain column 5 (sex).
  --no-pheno       : .fam/.ped file does not contain column 6 (phenotype).

  --tfile [prefix] : Specify .tped + .tfam filename prefix (default 'plink').
  --tped <fname>   : Specify full name of .tped file.
  --tfam <fname>   : Specify full name of .tfam file.

  --lfile [prefix] : Specify .lgen + .map + .fam (long-format fileset) prefix.
  --lgen <fname>   : Specify full name of .lgen file.
  --reference <fn> : Specify default allele file accompanying .lgen input.
  --allele-count   : When used with --lfile/--lgen + --reference, specifies
                     that the .lgen file contains reference allele counts.

  --vcf <filename> : Specify full name of .vcf or .vcf.gz file.
  --bcf <filename> : Specify full name of BCF2 file.

  --data [prefix]  : Specify Oxford .gen + .sample prefix (default 'plink').
  --gen <filename> : Specify full name of .gen or .gen.gz file.
  --bgen <f> ['snpid-chr'] : Specify full name of .bgen file.
  --sample <fname> : Specify full name of .sample file.

  --23file <fname> [FID] [IID] [sex] [pheno] [pat. ID] [mat. ID] :
    Specify 23andMe input file.

  --grm-gz [prfx]  : Specify .grm.gz + .grm.id (GCTA rel. matrix) prefix.
  --grm-bin [prfx] : Specify .grm.bin + .grm.N.bin + .grm.id (GCTA triangular
                     binary relationship matrix) filename prefix.

  --dummy <sample ct> <SNP ct> [missing geno freq] [missing pheno freq]
          [{acgt | 1234 | 12}] ['scalar-pheno']
    This generates a fake input dataset with the specified number of samples
    and SNPs.  By default, the missing genotype and phenotype frequencies are
    zero, and genotypes are As and Bs (change the latter with
    'acgt'/'1234'/'12').  The 'scalar-pheno' modifier causes a normally
    distributed scalar phenotype to be generated instead of a binary one.

  --simulate <simulation parameter file> [{tags | haps}] [{acgt | 1234 | 12}]
  --simulate-qt <sim. parameter file> [{tags | haps}] [{acgt | 1234 | 12}]
    --simulate generates a fake input dataset with disease-associated SNPs,
    while --simulate-qt generates a dataset with quantitative trait loci.

Output files have names of the form 'plink.<extension>' by default.  You can
change the 'plink' prefix with

  --out <prefix>   : Specify prefix for output files.

Most runs also require at least one of the following commands:

  --make-bed
    Create a new binary fileset.  Unlike the automatic text-to-binary
    converters (which only heed chromosome filters), this supports all of
    PLINK's filtering flags.
  --make-just-bim
  --make-just-fam
    Variants of --make-bed which only write a new .bim or .fam file.  Can be
    used with only .bim/.fam input.
    USE THESE CAUTIOUSLY.  It is very easy to desynchronize your binary
    genotype data and your .bim/.fam indexes if you use these commands
    improperly.  If you have any doubt, stick with --make-bed.

  --recode <output format> [{01 | 12}] [{tab | tabx | spacex | bgz | gen-gz}]
           ['include-alt'] ['omit-nonmale-y']
    Create a new text fileset with all filters applied.  The following output
    formats are supported:
    * '23': 23andMe 4-column format.  This can only be used on a single
      sample's data (--keep may be handy), and does not support multicharacter
      allele codes.
    * 'A': Sample-major additive (0/1/2) coding, suitable for loading from R.
      If you need uncounted alleles to be named in the header line, add the
      'include-alt' modifier.
    * 'AD': Sample-major additive (0/1/2) + dominant (het=1/hom=0) coding.
      Also supports 'include-alt'.
    * 'A-transpose': Variant-major 0/1/2.
    * 'beagle': Unphased per-autosome .dat and .map files, readable by early
      BEAGLE versions.
    * 'beagle-nomap': Single .beagle.dat file.
    * 'bimbam': Regular BIMBAM format.
    * 'bimbam-1chr': BIMBAM format, with a two-column .pos.txt file.  Does not
      support multiple chromosomes.
    * 'fastphase': Per-chromosome fastPHASE files, with
      .chr-<chr #>.recode.phase.inp filename extensions.
    * 'fastphase-1chr': Single .recode.phase.inp file.  Does not support
      multiple chromosomes.
    * 'HV': Per-chromosome Haploview files, with .chr-<chr #>{.ped,.info}
      filename extensions.
    * 'HV-1chr': Single Haploview .ped + .info file pair.  Does not support
      multiple chromosomes.
    * 'lgen': PLINK 1 long-format (.lgen + .fam + .map), loadable with --lfile.
    * 'lgen-ref': .lgen + .fam + .map + .ref, loadable with --lfile +
       --reference.
    * 'list': Single genotype-based list, up to 4 lines per variant.  To omit
      nonmale genotypes on the Y chromosome, add the 'omit-nonmale-y' modifier.
    * 'rlist': .rlist + .fam + .map fileset, where the .rlist file is a
      genotype-based list which omits the most common genotype for each
      variant.  Also supports 'omit-nonmale-y'.
    * 'oxford': Oxford-format .gen + .sample.  With the 'gen-gz' modifier, the
      .gen file is gzipped.
    * 'ped': PLINK 1 sample-major (.ped + .map), loadable with --file.
    * 'compound-genotypes': Same as 'ped', except that the space between each
      pair of same-variant allele codes is removed.
    * 'structure': Structure-format.
    * 'transpose': PLINK 1 variant-major (.tped + .tfam), loadable with
      --tfile.
    * 'vcf', 'vcf-fid', 'vcf-iid': VCFv4.2.  'vcf-fid' and 'vcf-iid' cause
      family IDs or within-family IDs respectively to be used for the sample
      IDs in the last header row, while 'vcf' merges both IDs and puts an
      underscore between them.  If the 'bgz' modifier is added, the VCF file is
      block-gzipped.
      The A2 allele is saved as the reference and normally flagged as not based
      on a real reference genome (INFO:PR).  When it is important for reference
      alleles to be correct, you'll also want to include --a2-allele and
      --real-ref-alleles in your command.
    In addition,
    * The '12' modifier causes A1 (usually minor) alleles to be coded as '1'
      and A2 alleles to be coded as '2', while '01' maps A1 -> 0 and A2 -> 1.
    * The 'tab' modifier makes the output mostly tab-delimited instead of
      mostly space-delimited.  'tabx' and 'spacex' force all tabs and all
      spaces, respectively.

  --flip-scan ['verbose']
    (alias: --flipscan)
    LD-based scan for case/control strand inconsistency.

  --write-covar
    If a --covar file is loaded, --make-bed/--make-just-fam and --recode
    automatically generate an updated version (with all filters applied).
    However, if you do not wish to simultaneously generate a new genotype file,
    you can use --write-covar to just produce a pruned covariate file.

  --write-cluster ['omit-unassigned']
    If clusters are specified with --within/--family, this generates a new
    cluster file (with all filters applied).  The 'omit-unassigned' modifier
    causes unclustered samples to be omitted from the file; otherwise their
    cluster is 'NA'.

  --write-set
  --set-table
    If sets have been defined, --write-set dumps 'END'-terminated set
    membership lists to <output prefix>.set, while --set-table writes a
    variant-by-set membership table to <output prefix>.set.table.

  --merge <.ped filename> <.map filename>
  --merge <text fileset prefix>
  --bmerge <.bed filename> <.bim filename> <.fam filename>
  --bmerge <binary fileset prefix>
    Merge the given fileset with the initially loaded fileset, writing the
    result to <output prefix>.bed + .bim + .fam.  (It is no longer necessary to
    simultaneously specify --make-bed.)
  --merge-list <filename>
    Merge all filesets named in the text file with the reference fileset, if
    one was specified.  (However, this can also be used *without* a reference;
    in that case, the newly created fileset is then treated as the reference by
    most other PLINK operations.)  The text file is interpreted as follows:
    * If a line contains only one name, it is assumed to be the prefix for a
      binary fileset.
    * If a line contains exactly two names, they are assumed to be the full
      filenames for a text fileset (.ped first, then .map).
    * If a line contains exactly three names, they are assumed to be the full
      filenames for a binary fileset (.bed, then .bim, then .fam).

  --write-snplist
  --list-23-indels
    --write-snplist writes a .snplist file listing the names of all variants
    which pass the filters and inclusion thresholds you've specified, while
    --list-23-indels writes the subset with 23andMe-style indel calls (D/I
    allele codes).

  --list-duplicate-vars ['require-same-ref'] ['ids-only'] ['suppress-first']
    --list-duplicate-vars writes a .dupvar file describing all groups of
    variants with matching positions and allele codes.
    * By default, A1/A2 allele assignments are ignored; use 'require-same-ref'
      to override this.
    * Normally, the report contains position and allele codes.  To remove them
      (and produce a file directly usable with e.g. --extract/--exclude), use
      'ids-only'.  Note that this command will fail in 'ids-only' mode if any
      of the reported IDs are not unique.
    * 'suppress-first' causes the first variant ID in each group to be omitted
      from the report.

  --freq [{counts | case-control}] ['gz']
  --freqx ['gz']
    --freq generates a basic allele frequency (or count, if the 'counts'
    modifier is present) report.  This can be combined with --within/--family
    to produce a cluster-stratified allele frequency/count report instead, or
    the 'case-control' modifier to report case and control allele frequencies
    separately.
    --freqx generates a more detailed genotype count report, designed for use
    with --read-freq.

  --missing ['gz']
    Generate sample- and variant-based missing data reports.  If clusters are
    defined, the variant-based report is cluster-stratified.  'gz' causes the
    output files to be gzipped.
    Unlike most other commands, this doesn't treat het. haploids as missing.

  --test-mishap
    Check for association between missing calls and flanking haplotypes.

  --hardy ['midp'] ['gz']
    Generate a Hardy-Weinberg exact test p-value report.  (This does NOT
    simultaneously filter on the p-value any more; use --hwe for that.)  With
    the 'midp' modifier, the test applies the mid-p adjustment described in
    Graffelman J, Moreno V (2013) The mid p-value in exact tests for
    Hardy-Weinberg Equilibrium.

  --mendel ['summaries-only']
    Generate a Mendel error report.  The 'summaries-only' modifier causes the
    .mendel file (listing every single error) to be skipped.

  --het ['small-sample'] ['gz']
  --ibc
    Estimate inbreeding coefficients.  --het reports method-of-moments
    estimates, while --ibc calculates all three values described in Yang J, Lee
    SH, Goddard ME and Visscher PM (2011) GCTA: A Tool for Genome-wide Complex
    Trait Analysis.  (That paper also describes the relationship matrix
    computation we reimplement.)
    * These functions require decent MAF estimates.  If there are very few
      samples in your immediate fileset, --read-freq is practically mandatory
      since imputed MAFs are wildly inaccurate in that case.
    * They also assume the marker set is in approximate linkage equilibrium.
    * By default, --het omits the n/(n-1) multiplier in Nei's expected
      homozygosity formula.  The 'small-sample' modifier causes it to be
      included, while forcing --het to use MAFs imputed from founders in the
      immediate dataset.

  --check-sex [female max F] [male min F]
  --check-sex ycount [female max F] [male min F] [female max Y obs]
                     [male min Y obs]
  --check-sex y-only [female max Y obs] [male min Y obs]
  --impute-sex [female max F] [male min F]
  --impute-sex ycount [female max F] [male min F] [female max Y obs]
                      [male min Y obs]
  --impute-sex y-only [female max Y obs] [male min Y obs]
    --check-sex normally compares sex assignments in the input dataset with
    those imputed from X chromosome inbreeding coefficients.
    * Make sure that the X chromosome pseudo-autosomal region has been split
      off (with e.g. --split-x) before using this.
    * You also need decent MAF estimates (so, with very few samples in your
      immediate fileset, use --read-freq), and your marker set should be in
      approximate linkage equilibrium.
    * By default, F estimates smaller than 0.2 yield female calls, and values
      larger than 0.8 yield male calls.  If you pass numeric parameter(s) to
      --check-sex, the first two control these thresholds.
    There are now two modes which consider Y chromosome data.
    * In 'ycount' mode, gender is still imputed from the X chromosome, but
      female calls are downgraded to ambiguous whenever more than 0 nonmissing
      Y genotypes are present, and male calls are downgraded when fewer than 0
      are present.  (Note that these are counts, not rates.)  These thresholds
      are controllable with --check-sex ycount's optional 3rd and 4th numeric
      parameters.
    * In 'y-only' mode, gender is imputed from nonmissing Y genotype counts.
      The male minimum threshold defaults to 1 instead of zero in this case.
    --impute-sex changes sex assignments to the imputed values, and is
    otherwise identical to --check-sex.  It must be used with
    --make-bed/--recode/--write-covar.

  --fst ['case-control']
    (alias: --Fst)
    Estimate Wright's Fst for each autosomal diploid variant using the method
    introduced in Weir BS, Cockerham CC (1984) Estimating F-statistics for the
    analysis of population structure, given a set of subpopulations defined via
    --within.  Raw and weighted global means are also reported.
    * If you're interested in the global means, it is usually best to perform
      this calculation on a marker set in approximate linkage equilibrium.
    * If you have only two subpopulations, you can represent them with
      case/control status and use the 'case-control' modifier.

  --indep <window size>['kb'] <step size (variant ct)> <VIF threshold>
  --indep-pairwise <window size>['kb'] <step size (variant ct)> <r^2 threshold>
  --indep-pairphase <window size>['kb'] <step size (variant ct)> <r^2 thresh>
    Generate a list of markers in approximate linkage equilibrium.  With the
    'kb' modifier, the window size is in kilobase instead of variant count
    units.  (Pre-'kb' space is optional, i.e. "--indep-pairwise 500 kb 5 0.5"
    and "--indep-pairwise 500kb 5 0.5" have the same effect.)
    Note that you need to rerun PLINK using --extract or --exclude on the
    .prune.in/.prune.out file to apply the list to another computation.

  --r [{square | square0 | triangle | inter-chr}] [{gz | bin | bin4}]
      ['spaces'] ['in-phase'] [{d | dprime | dprime-signed}] ['with-freqs']
      ['yes-really']
  --r2 [{square | square0 | triangle | inter-chr}] [{gz | bin | bin4}]
       ['spaces'] ['in-phase'] [{d | dprime | dprime-signed}] ['with-freqs']
       ['yes-really']
    LD statistic reports.  --r yields raw inter-variant correlations, while
    --r2 reports their squares.  You can request results for all pairs in
    matrix format (if you specify 'bin' or one of the shape modifiers), all
    pairs in table format ('inter-chr'), or a limited window in table format
    (default).
    * The 'gz' modifier causes the output text file to be gzipped.
    * 'bin' causes the output matrix to be written in double-precision binary
      format, while 'bin4' specifics single-precision binary.  The matrix is
      square if no shape is explicitly specified.
    * By default, text matrices are tab-delimited; 'spaces' switches this.
    * 'in-phase' adds a column with in-phase allele pairs to table-formatted
      reports.  (This cannot be used with very long allele codes.)
    * 'dprime' adds the absolute value of Lewontin's D-prime statistic to
      table-formatted reports, and forces both r/r^2 and D-prime to be based on
      the maximum likelihood solution to the cubic equation discussed in Gaunt
      T, Rodriguez S, Day I (2007) Cubic exact solutions for the estimation of
      pairwise haplotype frequencies.
      'dprime-signed' keeps the sign, while 'd' skips division by D_{max}.
    * 'with-freqs' adds MAF columns to table-formatted reports.
    * Since the resulting file can easily be huge, you're required to add the
      'yes-really' modifier when requesting an unfiltered, non-distributed all
      pairs computation on more than 400k variants.
    * These computations can be subdivided with --parallel (even when the
      'square' modifier is active).
  --ld <variant ID> <variant ID> ['hwe-midp']
    This displays haplotype frequencies, r^2, and D' for a single pair of
    variants.  When there are multiple biologically possible solutions to the
    haplotype frequency cubic equation, all are displayed (instead of just the
    maximum likelihood solution identified by --r/--r2), along with HWE exact
    test statistics.

  --show-tags <filename>
  --show-tags all
    * If a file is specified, list all variants which tag at least one variant
      named in the file.  (This will normally be a superset of the original
      list, since a variant is considered to tag itself here.)
    * If 'all' mode is specified, for each variant, each *other* variant which
      tags it is reported.

  --blocks ['no-pheno-req'] ['no-small-max-span']
    Estimate haplotype blocks, via Haploview's interpretation of the block
    definition suggested by Gabriel S et al. (2002) The Structure of Haplotype
    Blocks in the Human Genome.
    * Normally, samples with missing phenotypes are not considered by this
      computation; the 'no-pheno-req' modifier lifts this restriction.
    * Normally, size-2 blocks may not span more than 20kb, and size-3 blocks
      are limited to 30kb.  The 'no-small-max-span' modifier removes these
      limits.
    The .blocks file is valid input for PLINK 1.07's --hap command.  However,
    the --hap... family of flags has not been reimplemented in PLINK 1.9 due to
    poor phasing accuracy relative to other software; for now, we recommend
    using BEAGLE instead of PLINK for case/control haplotype association
    analysis.  (You can use "--recode beagle" to export data to BEAGLE 3.3.)
    We apologize for the inconvenience, and plan to develop variants of the
    --hap... flags which handle pre-phased data effectively.

  --distance [{square | square0 | triangle}] [{gz | bin | bin4}] ['ibs']
             ['1-ibs'] ['allele-ct'] ['flat-missing']
    Write a lower-triangular tab-delimited table of (weighted) genomic
    distances in allele count units to <output prefix>.dist, and a list of the
    corresponding sample IDs to <output prefix>.dist.id.  The first row of the
    .dist file contains a single <genome 1-genome 2> distance, the second row
    has the <genome 1-genome 3> and <genome 2-genome 3> distances in that
    order, etc.
    * It is usually best to perform this calculation on a marker set in
      approximate linkage equilibrium.
    * If the 'square' or 'square0' modifier is present, a square matrix is
      written instead; 'square0' fills the upper right triangle with zeroes.
    * If the 'gz' modifier is present, a compressed .dist.gz file is written
      instead of a plain text file.
    * If the 'bin' modifier is present, a binary (square) matrix of
      double-precision floating point values, suitable for loading from R, is
      instead written to <output prefix>.dist.bin.  ('bin4' specifies
      single-precision numbers instead.)  This can be combined with 'square0'
      if you still want the upper right zeroed out, or 'triangle' if you don't
      want to pad the upper right at all.
    * If the 'ibs' modifier is present, an identity-by-state matrix is written
      to <output prefix>.mibs.  '1-ibs' causes distances expressed as genomic
      proportions (i.e. 1 - IBS) to be written to <output prefix>.mdist.
      Combine with 'allele-ct' if you want to generate the usual .dist file as
      well.
    * By default, distance rescaling in the presence of missing genotype calls
      is sensitive to allele count distributions: if variant A contributes, on
      average, twice as much to other pairwise distances as variant B, a
      missing call at variant A will result in twice as large of a missingness
      correction.  To turn this off (because e.g. your missing calls are highly
      nonrandom), use the 'flat-missing' modifier.
    * The computation can be subdivided with --parallel.
  --distance-matrix
  --ibs-matrix
    These deprecated commands are equivalent to "--distance 1-ibs flat-missing
    square" and "--distance ibs flat-missing square", respectively, except that
    they generate space- instead of tab-delimited text matrices.

  --make-rel [{square | square0 | triangle}] [{gz | bin | bin4}]
             [{cov | ibc2 | ibc3}]
    Write a lower-triangular variance-standardized realized relationship matrix
    to <output prefix>.rel, and corresponding IDs to <output prefix>.rel.id.
    * It is usually best to perform this calculation on a marker set in
      approximate linkage equilibrium.
    * 'square', 'square0', 'triangle', 'gz', 'bin', and 'bin4' act as they do
      on --distance.
    * The 'cov' modifier removes the variance standardization step, causing a
      covariance matrix to be calculated instead.
    * By default, the diagonal elements in the relationship matrix are based on
      --ibc's Fhat1; use the 'ibc2' or 'ibc3' modifiers to base them on Fhat2
      or Fhat3 instead.
    * The computation can be subdivided with --parallel.
  --make-grm-gz ['no-gz'] [{cov | ibc2 | ibc3}]
  --make-grm-bin [{cov | ibc2 | ibc3}]
    --make-grm-gz writes the relationships in GCTA's original gzipped list
    format, which describes one pair per line, while --make-grm-bin writes them
    in GCTA 1.1+'s single-precision triangular binary format.  Note that these
    formats explicitly report the number of valid observations (where neither
    sample has a missing call) for each pair, which is useful input for some
    scripts.
    These computations can be subdivided with --parallel.

  --rel-cutoff [val]
    (alias: --grm-cutoff)
    Exclude one member of each pair of samples with relatedness greater than
    the given cutoff value (default 0.025).  If no later operation will cause
    the list of remaining samples to be written to disk, this will save it to
    <output prefix>.rel.id.
    Note that maximizing the remaining sample size is equivalent to the NP-hard
    maximum independent set problem, so we use a greedy algorithm instead of
    guaranteeing optimality.  (Use the --make-rel and --keep/--remove flags if
    you want to try to do better.)

  --ibs-test [permutation count]
  --groupdist [iters] [d]
    Given case/control phenotype data, these commands consider three subsets of
    the distance matrix: pairs of affected samples, affected-unaffected pairs,
    and pairs of unaffected samples.  Each of these subsets has a distribution
    of pairwise genomic distances; --ibs-test uses permutation to estimate
    p-values re: which types of pairs are most similar, while --groupdist
    focuses on the differences between the centers of these distributions and
    estimates standard errors via delete-d jackknife.

  --regress-distance [iters] [d]
    Linear regression of pairwise genomic distances on pairwise average
    phenotypes and vice versa, using delete-d jackknife for standard errors.  A
    scalar phenotype is required.
    * With less than two parameters, d is set to <number of people>^0.6 rounded
      down.  With no parameters, 100k iterations are run.
  --regress-rel [iters] [d]
    Linear regression of pairwise genomic relationships on pairwise average
    phenotypes, and vice versa.  Defaults for iters and d are the same as for
    --regress-distance.

  --genome ['gz'] ['rel-check'] ['full'] ['unbounded'] ['nudge']
    Generate an identity-by-descent report.
    * It is usually best to perform this calculation on a marker set in
      approximate linkage equilibrium.
    * The 'rel-check' modifier excludes pairs of samples with different FIDs
      from the final report.
    * 'full' adds raw pairwise comparison data to the report.
    * The P(IBD=0/1/2) estimator employed by this command sometimes yields
      numbers outside the range [0,1]; by default, these are clipped.  The
      'unbounded' modifier turns off this clipping.
    * Then, when PI_HAT^2 < P(IBD=2), 'nudge' adjusts the final P(IBD=0/1/2)
      estimates to a theoretically possible configuration.
    * The computation can be subdivided with --parallel.

  --homozyg [{group | group-verbose}] ['consensus-match'] ['extend']
            ['subtract-1-from-lengths']
  --homozyg-snp <min var count>
  --homozyg-kb <min length>
  --homozyg-density <max inverse density (kb/var)>
  --homozyg-gap <max internal gap kb length>
  --homozyg-het <max hets>
  --homozyg-window-snp <scanning window size>
  --homozyg-window-het <max hets in scanning window hit>
  --homozyg-window-missing <max missing calls in scanning window hit>
  --homozyg-window-threshold <min scanning window hit rate>
    These commands request a set of run-of-homozygosity reports, and allow you
    to customize how they are generated.
    * If you're satisfied with all the default settings described below, just
      use --homozyg with no modifiers.  Otherwise, --homozyg lets you change a
      few binary settings:
      * 'group[-verbose]' adds a report on pools of overlapping runs of
        homozygosity.  (Automatically set when --homozyg-match is present.)
      * With 'group[-verbose]', 'consensus-match' causes pairwise segmental
        matches to be called based on the variants in the pool's consensus
        segment, rather than the variants in the pairwise intersection.
      * Due to how the scanning window algorithm works, it is possible for a
        reported ROH to be adjacent to a few homozygous variants.  The 'extend'
        modifier causes them to be included in the reported ROH if that
        wouldn't cause a violation of the --homozyg-density bound.
      * By default, segment bp lengths are calculated as <end bp position> -
        <start bp position> + 1.  Therefore, reports normally differ slightly
        from PLINK 1.07, which does not add 1 at the end.  For testing
        purposes, you can use the 'subtract-1-from-lengths' modifier to apply
        the old formula.
    * By default, only runs of homozygosity containing at least 100 variants,
      and of total length >= 1000 kilobases, are noted.  You can change these
      minimums with --homozyg-snp and --homozyg-kb, respectively.
    * By default, a ROH must have at least one variant per 50 kb on average;
      change this bound with --homozyg-density.
    * By default, if two consecutive variants are more than 1000 kb apart, they
      cannot be in the same ROH; change this bound with --homozyg-gap.
    * By default, a ROH can contain an unlimited number of heterozygous calls;
      you can impose a limit with --homozyg-het.
    * By default, the scanning window contains 50 variants; change this with
      --homozyg-window-snp.
    * By default, a scanning window hit can contain at most 1 heterozygous
      call and 5 missing calls; change these limits with --homozyg-window-het
      and --homozyg-window-missing, respectively.
    * By default, for a variant to be eligible for inclusion in a ROH, the hit
      rate of all scanning windows containing the variant must be at least
      0.05; change this threshold with --homozyg-window-threshold.

  --cluster ['cc'] [{group-avg | old-tiebreaks}] ['missing'] ['only2']
    Cluster samples using a pairwise similarity statistic (normally IBS).
    * The 'cc' modifier forces every cluster to have at least one case and one
      control.
    * The 'group-avg' modifier causes clusters to be joined based on average
      instead of minimum pairwise similarity.
    * The 'missing' modifier causes clustering to be based on
      identity-by-missingness instead of identity-by-state, and writes a
      space-delimited identity-by-missingness matrix to disk.
    * The 'only2' modifier causes only a .cluster2 file (which is valid input
      for --within) to be written; otherwise 2 other files will be produced.
    * By default, IBS ties are not broken in the same manner as PLINK 1.07, so
      final cluster solutions tend to differ.  This is generally harmless.
      However, to simplify testing, you can use the 'old-tiebreaks' modifier to
      force emulation of the old algorithm.

  --pca [count] ['header'] ['tabs'] ['var-wts']
    Calculates a variance-standardized relationship matrix (use
    --make-rel/--make-grm-gz/--make-grm-bin to dump it), and extracts the top
    20 principal components.
    * It is usually best to perform this calculation on a marker set in
      approximate linkage equilibrium.
    * You can change the number of PCs by passing a numeric parameter.
    * The 'header' modifier adds a header line to the .eigenvec output file.
      (For compatibility with the GCTA flag of the same name, the default is no
      header line.)
    * The 'tabs' modifier causes the .eigenvec file(s) to be tab-delimited.
    * The 'var-wts' modifier requests an additional .eigenvec.var file with PCs
      expressed as variant weights instead of sample weights.

  --neighbour <n1> <n2>
    (alias: --neighbor)
    Report IBS distances from each sample to their n1th- to n2th-nearest
    neighbors, associated Z-scores, and the identities of those neighbors.
    Useful for outlier detection.

  --assoc ['perm' | 'mperm='<value>] ['perm-count'] [{fisher | fisher-midp}]
          ['counts'] ['set-test']
  --assoc ['perm' | 'mperm='<value>] ['perm-count'] ['qt-means'] ['lin']
          ['set-test']
  --model ['perm' | 'mperm='<value>] ['perm-count']
          [{fisher | fisher-midp | trend-only}] ['set-test']
          [{dom | rec | gen | trend}]
    Basic association analysis report.
    Given a case/control phenotype, --assoc performs a 1df chi-square allelic
    test, while --model performs 4 other tests as well (1df dominant gene
    action, 1df recessive gene action, 2df genotypic, Cochran-Armitage trend).
    * With 'fisher'/'fisher-midp', Fisher's exact test is used to generate
      p-values.  'fisher-midp' also applies Lancaster's mid-p adjustment.
    * 'perm' causes an adaptive permutation test to be performed.
    * 'mperm='<value> causes a max(T) permutation test with the specified
      number of replications to be performed.
    * 'perm-count' causes the permutation test report to include counts instead
      of frequencies.
    * 'counts' causes --assoc to report allele counts instead of frequencies.
    * 'set-test' tests the significance of variant sets.  Requires permutation;
      can be customized with --set-p/--set-r2/--set-max.
    * 'dom', 'rec', 'gen', and 'trend' force the corresponding test to be used
      as the basis for --model permutation.  (By default, the most significant
      result among the allelic, dominant, and recessive tests is used.)
    * 'trend-only' causes only the trend test to be performed.
    Given a quantitative phenotype, --assoc normally performs a Wald test.
    * In this case, the 'qt-means' modifier causes trait means and standard
      deviations stratified by genotype to be reported as well.
    * 'lin' causes the Lin statistic to be computed, and makes it the basis for
      multiple-testing corrections and permutation tests.
    Several other flags (most notably, --aperm) can be used to customize the
    permutation test.

  --mh ['perm' | 'mperm='<value>] ['perm-count'] ['set-test']
    (alias: --cmh)
  --bd ['perm' | 'perm-bd' | 'mperm='<value>] ['perm-count'] ['set-test']
  --mh2
  --homog
    Given a case/control phenotype and a set of clusters, --mh computes 2x2xK
    Cochran-Mantel-Haenszel statistics for each variant, while --bd also
    performs the Breslow-Day test for odds ratio homogeneity.  Permutation and
    variant set testing based on the CMH (default) or Breslow-Day (when
    'perm-bd' is present) statistic are supported.
    The following similar analyses are also available:
    * --mh2 swaps the roles of case/control status and cluster membership,
      performing a phenotype-stratified IxJxK Cochran-Mantel-Haenszel test on
      association between cluster assignments and genotypes.
    * --homog executes an alternative to the Breslow-Day test, based on
      partitioning of the chi-square statistic.

  --gxe [covariate index]
    Given both a quantitative phenotype and a case/control covariate loaded
    with --covar defining two groups, --gxe compares the regression coefficient
    derived from considering only members of one group to the regression
    coefficient derived from considering only members of the other.  By
    default, the first covariate in the --covar file defines the groups; use
    e.g. "--gxe 3" to base them on the third covariate instead.

  --linear ['perm' | 'mperm='<value>] ['perm-count'] ['set-test']
           [{genotypic | hethom | dominant | recessive | no-snp}]
           ['hide-covar'] [{sex | no-x-sex}] ['interaction'] ['beta']
           ['standard-beta'] ['intercept']
  --logistic ['perm' | 'mperm='<value>] ['perm-count'] ['set-test']
             [{genotypic | hethom | dominant | recessive | no-snp}]
             ['hide-covar'] [{sex | no-x-sex}] ['interaction'] ['beta']
             ['intercept']
    Multi-covariate association analysis on a quantitative (--linear) or
    case/control (--logistic) phenotype.  Normally used with --covar.
    * 'perm' normally causes an adaptive permutation test to be performed on
      the main effect, while 'mperm='<value> starts a max(T) permutation test.
    * 'perm-count' causes the permutation test report to include counts instead
      of frequencies.
    * 'set-test' tests the significance of variant sets.  Requires permutation;
      can be customized with --set-p/--set-r2/--set-max.
    * The 'genotypic' modifier adds an additive effect/dominance deviation 2df
      joint test (0/1/2 and 0/1/0 coding), while 'hethom' uses 0/0/1 and 0/1/0
      coding instead.  If permutation is also requested, these modifiers cause
      permutation to be based on the joint test.
    * 'dominant' and 'recessive' specify a model assuming full dominance or
      recessiveness, respectively, for the A1 allele.
    * 'no-snp' causes regression to be performed only on the phenotype and the
      covariates, without reference to genomic data.  If permutation is also
      requested, results are reported for all covariates.
    * 'hide-covar' removes covariate-specific lines from the report.
    * By default, sex (male = 1, female = 0) is automatically added as a
      covariate on X chromosome variants, and nowhere else.  The 'sex' modifier
      causes it to be added everywhere, while 'no-x-sex' excludes it.
    * 'interaction' adds genotype x covariate interactions to the model.  This
      cannot be used with the usual permutation tests; use --tests to define
      the permutation test statistic instead.
    * 'intercept' causes intercepts to be included in the main report.
    * For logistic regressions, the 'beta' modifier causes regression
      coefficients instead of odds ratios to be reported.
    * With --linear, the 'standard-beta' modifier standardizes the phenotype
      and all predictors to zero mean and unit variance before regression.

  --dosage <allele dosage file> ['noheader'] ['skip0='<i>] ['skip1='<j>]
           ['skip2='<k>] ['dose1'] ['format='<m>] ['Zout']
           [{occur | standard-beta}] ['sex'] ['case-control-freqs']
  --dosage <list file> list [{sepheader | noheader}] ['skip0='<i>]
           ['skip1='<j>] ['skip2='<k>] ['dose1'] ['format='<m>] ['Zout']
           [{occur | standard-beta}] ['sex'] ['case-control-freqs']
  --write-dosage
    Process (possibly gzipped) text files with variant-major allelic dosage
    data.  This cannot be used with a regular input fileset; instead, you must
    *only* specify a .fam and possibly a .map file, and you can't specify any
    other commands.
    * PLINK 2.0 will have first-class support for genotype probabilities.  An
      equivalent data import flag will be provided then, and --dosage will be
      retired.
    * By default, --dosage assumes that only one allelic dosage file should be
      loaded.  To specify multiple files,
      1. create a master list with one entry per line.  There are normally two
         supported formats for this list: just a filename per line, or variant
         batch numbers in the first column and filenames in the second.
      2. Provide the name of that list as the first --dosage parameter.
      3. Add the 'list' modifier.
    * By default, --dosage assumes the allelic dosage file(s) contain a header
      line, which has 'SNP' in column i+1, 'A1' in column i+j+2, 'A2' in column
      i+j+3, and sample FID/IIDs starting from column i+j+k+4.  (i/j/k are
      normally zero, but can be changed with 'skip0', 'skip1', and 'skip2'
      respectively.)  If such a header line is not present,
      * when all samples appear in the same order as they do in the .fam file,
        you can use the 'noheader' modifier.
      * Otherwise, use the 'sepheader' modifier, and append sample ID filenames
        to your 'list' file entries.
    * The 'format=' modifier lets you specify the number of values used to
      represent each dosage.  'format=1' normally indicates a single 0..2 A1
      expected count; 'dose1' modifies this to a 0..1 frequency.  'format=2'
      (the default) indicates a 0..1 homozygous A1 likelihood followed by a
      0..1 het likelihood, while 'format=3' indicates 0..1 hom A1, 0..1 het,
      0..1 hom A2.
    * 'Zout' causes the output file to be gzipped.
    * Normally, an association analysis is performed.  'standard-beta' and
      'sex' behave as they are supposed to with --linear/--logistic.
      'case-control-freqs' causes case and control allele frequencies to be
      reported separately.
    * There are three alternate modes which cause the association analysis to
      be skipped.
      * 'occur' requests a simple variant occurrence report.
      * --write-dosage causes a simple merged file matching the 'format'
        specification (not including 'dose1') to be generated.
      * --score applies a linear scoring system to the dosages.

  --lasso <h2 estimate> [min lambda] ['report-zeroes']
    Estimate variant effect sizes via LASSO regression.  You must provide an
    additive heritability estimate to calibrate the regression.
    Note that this method may require a very large sample size (e.g. hundreds
    of thousands) to be effective on complex polygenic traits.

  --test-missing ['perm' | 'mperm='<value>] ['perm-count'] ['midp']
    Check for association between missingness and case/control status, using
    Fisher's exact test.  (Het. haploids are treated as missing.)
    The 'midp' modifier causes Lancaster's mid-p adjustment to be applied.

  --make-perm-pheno <ct>
    Generate phenotype permutations and write them to disk, without invoking an
    association test.

  --tdt [{exact | exact-midp | poo}] ['perm' | 'mperm='<value>] ['perm-count']
        [{parentdt1 | parentdt2 | pat | mat}] ['set-test']
    Report transmission disequilibrium test statistics, given case/control
    phenotypes and pedigree information.
    * A Mendel error check is performed before the main tests; offending
      genotypes are treated as missing by this analysis.
    * By default, the basic TDT p-value is based on a chi-square test unless
      you request the exact binomial test with 'exact' or 'exact-midp'.
    * 'perm'/'mperm=' requests a family-based adaptive or max(T) permutation
      test.  By default, the permutation test statistic is the basic TDT
      p-value; 'parentdt1'/'parentdt2' cause parenTDT or combined test
      p-values, respectively, to be considered instead.
    * 'set-test' tests the significance of variant sets.  This cannot be used
      with exact tests for now.
    The 'poo' modifier causes a parent-of-origin analysis to be performed
    instead, with transmissions from heterozygous fathers and heterozygous
    mothers considered separately.
    * The parent-of-origin analysis does not currently support exact tests.
    * By default, the permutation test statistic is the absolute
      parent-of-origin test Z score; 'pat'/'mat' cause paternal or maternal TDT
      chi-square statistics, respectively, to be considered instead.

  --qfam ['perm' | 'mperm='<value>] ['perm-count'] ['emp-se']
  --qfam-parents ['perm' | 'mperm='<value>] ['perm-count'] ['emp-se']
  --qfam-between ['perm' | 'mperm='<value>] ['perm-count'] ['emp-se']
  --qfam-total ['perm' | 'mperm='<value>] ['perm-count'] ['emp-se']
    QFAM family-based association test for quantitative traits.
    * A Mendel error check is performed before the main tests; offending
      genotypes are treated as missing by this analysis.
    * This procedure requires permutation.  'perm' and 'perm-count' have the
      usual meanings.  However, 'mperm='<value> just specifies a fixed number
      of permutations; the method does not support a proper max(T) test.
    * The 'emp-se' modifier adds BETA and EMP_SE (empirical standard error for
      beta) fields to the .perm output file.

  --annotate <PLINK report> ['attrib='<file>] ['ranges='<file>]
             ['filter='<file>] ['snps='<file>] [{NA | prune}] ['block']
             ['subset='<file>] ['minimal'] ['distance']
    Add annotations to a variant-based PLINK report.  This requires an
    annotation source:
    * 'attrib='<file> specifies a (possibly gzipped) attribute file.
    * 'ranges='<file> specifies a gene/range list file.
    (Both source types can be specified simultaneously.)  The following options
    are also supported:
    * 'filter='<file> causes only variants within one of the ranges in the file
      to be included in the new report.
    * 'snps='<file> causes only variants named in the file to be included in
      the new report.
    * The 'NA' modifier causes unannotated variants to have 'NA' instead of '.'
      in the new report's ANNOT column, while the 'prune' modifier excludes
      them entirely.
    * The 'block' modifier replaces the single ANNOT column with a 0/1-coded
      column for each possible annotation.
    * With 'ranges',
      * 'subset='<file> causes only intervals named in the subset file to be
        loaded from the ranges file.
      * interval annotations normally come with a parenthesized signed distance
        to the interval boundary (0 if the variant is located inside the
        interval; this is always true without --border).  They can be excluded
        with the 'minimal' modifier.
      * the 'distance' modifier adds 'DIST' and 'SGN' columns describing signed
        distance to the nearest interval.
    * When --pfilter is present, high p-values are filtered out.

  --clump <PLINK report filename(s)...>
    Process association analysis report(s) with 'SNP' and p-value columns,
    organizing results by LD-based clumps.  Multiple filenames can be separated
    by spaces or commas.

  --gene-report <PLINK report> <gene range file>
    Generate a gene-based report from a variant-based report.
    * When --pfilter is present, high p-values are filtered out.
    * When --extract (without 'range') is present, only variants named in the
      --extract file are considered.

  --meta-analysis <PLINK report filenames...>
  --meta-analysis <PLINK report filenames...> + [{logscale | qt}]
                  [{no-map | no-allele}] ['study'] ['report-all']
                  ['weighted-z']
    Perform a meta-analysis on several variant-based reports with 'SNP' and
    'SE' fields.
    * Normally, an 'OR' odds ratio field must also be present in each input
      file.  With 'logscale', 'BETA' log-odds values/regression coefficients
      are expected instead, but the generated report will still contain odds
      ratio estimates.  With 'qt', both input and output values are regression
      betas.
    * 'CHR', 'BP', and 'A1' fields are also normally required.  'no-map' causes
      them to all be ignored, while 'no-allele' causes just 'A1' to be ignored.
    * If 'A2' fields are present, and neither 'no-map' nor 'no-allele' was
      specified, A1/A2 allele flips are handled properly.  Otherwise, A1
      mismatches are thrown out.
    * 'study' causes study-specific effect estimates to be collated in the
      meta-analysis report.
    * 'report-all' causes variants present in only a single input file to be
      included in the meta-analysis report.
    * 'weighted-z' requests weighted Z-score-based p-values (as computed by the
      Abecasis Lab's METAL software) in addition to the usual inverse
      variance-based analysis.  This requires P and effective sample size
      fields.
    * When --extract (without 'range') is present, only variants named in the
      --extract file are considered.
    * Unless 'no-map' is specified, chromosome filters are also respected.

  --fast-epistasis [{boost | joint-effects | no-ueki}] ['case-only']
                   [{set-by-set | set-by-all}] ['nop']
  --epistasis [{set-by-set | set-by-all}]
    Scan for epistatic interactions.  --fast-epistasis inspects 3x3 joint
    genotype count tables and only applies to case/control phenotypes, while
    --epistasis performs linear or logistic regression.
    * By default, --fast-epistasis uses the PLINK 1.07 allele-based test.  Two
      newer tests are now supported: 'boost' invokes the likelihood ratio test
      introduced by Wan X et al. (2010) BOOST: A Fast Approach to Detecting
      Gene-Gene Interactions in Genome-wide Case-Control Studies, while
      'joint-effects' applies the joint effects test introduced in Ueki M,
      Cordell HJ (2012) Improved statistics for genome-wide interaction
      analysis.
    * The original --fast-epistasis test normally applies the variance and
      empty cell corrections suggested by Ueki and Cordell's paper.  To disable
      them, use the 'no-ueki' modifier.
    * 'case-only' requests a case-only instead of a case/control test.
    * By default, all pairs of variants across the entire genome are tested.
      To just test pairs of variants within a single set, add the 'set-by-set'
      modifier and load exactly one set with --set/--make-set; with exactly two
      sets loaded, all variants in one set are tested against all variants in
      the other.  'set-by-all' tests all variants in one set against the entire
      genome instead.
    * 'nop' strips p-values from the main report.
    * These computations can be subdivided with --parallel; however...
  --epistasis-summary-merge <common file prefix> <ct>
    When a --[fast-]epistasis job is subdivided with --parallel, the main
    report can be assembled at the end by applying Unix 'cat' in the usual
    manner, but the .summary.1, .summary.2, ... files may require a specialized
    merge.  --epistasis-summary-merge takes care of the latter.

  --twolocus <variant ID> <variant ID>
    Two-locus joint genotype count report.

  --score <filename> [i] [j] [k] ['header'] [{sum | no-sum}]
          [{no-mean-imputation | center}] ['include-cnt'] ['double-dosage']
    Apply a linear scoring system to each sample.
    The input file should have one line per scored variant.  Variant IDs are
    read from column #i, allele codes are read from column #j, and scores are
    read from column #k, where i defaults to 1, j defaults to i+1, and k
    defaults to j+1.
    * The 'header' modifier causes the first nonempty line of the input file to
      be ignored; otherwise, --score assumes there is no header line.
    * By default, final scores are averages of the valid per-variant scores.
      The 'sum' modifier causes sums to be reported instead.  (This cannot be
      used with 'no-mean-imputation'.  And for backward compatibility, 'sum' is
      automatically on with dosage data unless 'no-sum' is specified.)
    * By default, copies of the unnamed allele contribute zero to score, while
      missing genotypes contribute an amount proportional to the loaded (via
      --read-freq) or imputed allele frequency.  To throw out missing
      observations instead (decreasing the denominator in the final average
      when this happens), use the 'no-mean-imputation' modifier.
    * Alternatively, you can use the 'center' modifier to shift all scores to
      mean zero.
    * This command can be used with dosage data.  By default, the 'CNT' column
      is omitted from the output file in this case; use 'include-cnt' to keep
      it.  Also, note that scores are multiplied by 0..1 dosages, not 0..2
      diploid allele counts, unless the 'double-dosage' modifier is present.

  --R <R script file> ['debug']
    Connect to a Rserve (preferably version 1.7 or later) background process,
    and execute the Rplink function defined in the input file.  (Unless the
    'debug' modifier is present; in that case, the R commands that PLINK would
    have tried to execute are logged to a file.)

  --write-var-ranges <block ct>
    Divide the set of variants into equal-size blocks.  (Can be used with
    --snps to split a job across multiple machines.)

The following other flags are supported.  (Order of operations is described at
https://www.cog-genomics.org/plink/1.9/order .)
  --script <fname> : Include command-line options from file.
  --rerun [log]    : Rerun commands in log (default 'plink.log').
  --version        : Display only version number before exiting.
  --silent         : Suppress output to console.
  --gplink         : Reserved for interoperation with gPLINK.
  --missing-genotype <char> : Set missing genotype code (normally '0').
  --double-id          : Set both FIDs and IIDs to the VCF/BCF sample ID.
  --const-fid [ID]     : Set all FIDs to the given constant (default '0').
  --id-delim [d]       : Parse sample IDs as <FID><d><IID> (default delim '_').
  --vcf-idspace-to <c> : Convert spaces in sample IDs to the given character.
  --biallelic-only ['strict'] ['list'] : Skip VCF variants with 2+ ALT alleles.
  --vcf-min-qual <val>           : Skip VCF variants with low/missing QUAL.
  --vcf-filter [exception(s)...] : Skip variants which have FILTER failures.
  --vcf-require-gt               : Skip variants with no GT field.
  --vcf-min-gq <val>             : No-call a genotype when GQ is below the
                                   given threshold.
  --vcf-min-gp <val>             : No-call a genotype when 0-1 scaled GP is
                                   below the given threshold.
  --vcf-half-call <m>  : Specify how '0/.' and similar VCF GT values should be
                         handled.  The following four modes are supported:
                         * 'error'/'e' (default) errors out and reports line #.
                         * 'haploid'/'h' treats them as haploid calls.
                         * 'missing'/'m' treats them as missing.
                         * 'reference'/'r' treats the missing value as 0.
  --oxford-single-chr <chr nm> : Specify single-chromosome .gen file with
                                 ignorable first column.
  --oxford-pheno-name <col nm> : Import named phenotype from the .sample file.
  --hard-call-threshold <val>  : When an Oxford-format fileset is loaded, calls
  --hard-call-threshold random   with uncertainty level greater than 0.1 are
                                 normally treated as missing.  You can adjust
                                 this threshold by providing a numeric
                                 parameter, or randomize all calls with
                                 'random'.
  --missing-code [string list] : Comma-delimited list of missing phenotype
    (alias: --missing_code)      values for Oxford-format filesets (def. 'NA').
  --simulate-ncases <num>   : Set --simulate case count (default 1000).
  --simulate-ncontrols <n>  : Set --simulate control count (default 1000).
  --simulate-prevalence <p> : Set --simulate disease prevalence (default 0.01).
  --simulate-n <num>        : Set --simulate-qt sample count (default 1000).
  --simulate-label <prefix> : Set --simulate[-qt] FID/IID name prefix.
  --simulate-missing <freq> : Set --simulate[-qt] missing genotype frequency.
  --allow-extra-chr ['0']   : Permit unrecognized chromosome codes.  The '0'
    (alias: --aec)            modifier causes them to be treated as if they had
                              been set to zero.
  --chr-set <autosome ct> ['no-x'] ['no-y'] ['no-xy'] ['no-mt'] :
    Specify a nonhuman chromosome set.  The first parameter sets the number of
    diploid autosome pairs if positive, or haploid chromosomes if negative.
    Given diploid autosomes, the remaining modifiers indicate the absence of
    the named non-autosomal chromosomes.
  --cow/--dog/--horse/--mouse/--rice/--sheep : Shortcuts for those species.
  --autosome-num <value>    : Alias for "--chr-set <value> no-y no-xy no-mt".
  --cm-map <fname pattern> [chr] : Use SHAPEIT-format recombination maps to set
                                   centimorgan positions.  To process more than
                                   one chromosome, include a '@' in the first
                                   parameter where the chrom. number belongs,
                                   e.g. 'genetic_map_chr@_combined_b37.txt'.
  --zero-cms         : Zero out centimorgan positions.
  --allow-no-samples : Allow the input fileset to contain no samples.
  --allow-no-vars    : Allow the input fileset to contain no variants.
  --pheno <fname>  : Load phenotype data from the specified file, instead of
                     using the values in the main input fileset.
  --all-pheno      : For basic association tests, loop through all phenotypes
                     in --pheno file.
  --mpheno <n>     : Load phenotype from column (n+2) in --pheno file.
  --pheno-name <c> : If --pheno file has a header row, use column with the
                     given name.
  --pheno-merge    : When the main input fileset contains an phenotype value
                     for a sample, but the --pheno file does not, use the
                     original value instead of treating the phenotype as
                     missing.
  --missing-phenotype <v> : Set missing phenotype value (normally -9).
  --1                     : Expect case/control phenotypes to be coded as
                            0 = control, 1 = case, instead of the usual
                            0 = missing, 1 = control, 2 = case.  This also
                            forces phenotypes to be interpreted as case/ctrl.
  --make-pheno <fn> <val> : Define a new case/control phenotype.  If the val
                            parameter is '*', all samples listed in the given
                            file are cases, and everyone else is a control.
                            (Note that, in some shells, it is necessary to
                            surround the * with quotes.)
                            Otherwise, all samples with third column entry
                            equal to the val parameter are cases, and all other
                            samples mentioned in the file are controls.
  --tail-pheno <Lt> [Hbt] : Downcode a scalar phenotype to a case/control
                            phenotype.  All samples with phenotype values
                            greater than Hbt are cases, and all with values
                            less than or equal to Lt are controls.  If Hbt is
                            unspecified, it is equal to Lt; otherwise,
                            in-between phenotype values are set to missing.
  --covar <filename> ['keep-pheno-on-missing-cov'] : Specify covariate file.
  --covar-name <...>       : Specify covariate(s) in --covar file by name.
                             Separate multiple names with spaces or commas, and
                             use dashes to designate ranges.
  --covar-number <...>     : Specify covariate(s) in --covar file by index.
  --no-const-covar         : Exclude constant covariates.
  --allow-no-covars        : Allow no covariates to be loaded from --covar
                             file.
  --within <f> ['keep-NA'] : Specify initial cluster assignments.
  --mwithin <n>            : Load cluster assignments from column n+2.
  --family                 : Create a cluster for each family ID.
  --loop-assoc <f> ['keep-NA']  : Run specified case/control association
                                  commands once for each cluster in the file,
                                  using cluster membership as the phenotype.
  --set <filename>              : Load sets from a .set file.
  --set-names <name(s)...>      : Load only sets named on the command line.
                                  Use spaces to separate multiple names.
  --subset <filename>           : Load only sets named in the given text file.
  --set-collapse-all <set name> : Merge all sets.
  --complement-sets             : Invert all sets.  (Names gain 'C_' prefixes.)
  --make-set-complement-all <s> : --set-collapse-all + inversion.
  --make-set <filename>         : Define sets from a list of named bp ranges.
  --make-set-border <kbs>       : Stretch regions in --make-set file.
  --make-set-collapse-group     : Define sets from groups instead of sets in
                                  --make-set file.
  --keep <filename>       : Exclude all samples not named in the file.
  --remove <filename>     : Exclude all samples named in the file.
  --keep-fam <filename>   : Exclude all families not named in the file.
  --remove-fam <filename> : Exclude all families named in the file.
  --extract ['range'] <f> : Exclude all variants not named in the file.
  --exclude ['range'] <f> : Exclude all variants named in the file.
  --keep-clusters <filename>          : These can be used individually or in
  --keep-cluster-names <name(s)...>     combination to define a list of
                                        clusters to keep; all samples not in a
                                        cluster in that list are then excluded.
                                        Use spaces to separate cluster names
                                        for --keep-cluster-names.
  --remove-clusters <filename>        : Exclude all clusters named in the file.
  --remove-cluster-names <name(s)...> : Exclude the named clusters.
  --gene <sets...> : Exclude variants not in a set named on the command line.
                     (Separate multiple set names with spaces.)
  --gene-all       : Exclude variants which aren't a member of any set.  (PLINK
                     1.07 automatically did this under some circumstances.)
  --attrib <f> [att lst] : Given a file assigning attributes to variants, and a
  --attrib-indiv <f> [a]   comma-delimited list (with no whitespace) of
                           attribute names, remove variants/samples which are
                           either missing from the file or don't have any of
                           the listed attributes.  If some attribute names in
                           the list are preceded by '-', they are treated as
                           "negative match conditions" instead: variants with
                           at least one negative match attribute are removed.
                           The first character in the list cannot be a '-', due
                           to how command-line parsing works; add a comma in
                           front to get around this.
  --chr <chrs...>  : Exclude all variants not on the given chromosome(s).
                     Valid choices for humans are 0 (unplaced), 1-22, X, Y, XY,
                     and MT.  Separate multiple chromosomes with spaces and/or
                     commas, and use a dash (no adjacent spaces permitted) to
                     denote a range, e.g. "--chr 1-4, 22, xy".
  --not-chr <...>  : Reverse of --chr (exclude variants on listed chromosomes).
  --autosome       : Exclude all non-autosomal variants.
  --autosome-xy    : Exclude all non-autosomal variants, except those with
                     chromosome code XY (pseudo-autosomal region of X).
  --snps-only ['just-acgt'] : Exclude non-SNP variants.  By default, SNP = both
                              allele codes are single-character; 'just-acgt'
                              restricts codes to {A,C,G,T,a,c,g,t,<missing>}.
  --from <var ID>  : Use ID(s) to specify a variant range to load.  When used
  --to   <var ID>    together, both variants must be on the same chromosome.
  --snp  <var ID>  : Specify a single variant to load.
  --exclude-snp <> : Specify a single variant to exclude.
  --window  <kbs>  : With --snp or --exclude-snp, loads/excludes all variants
                     within half the specified kb distance of the named one.
  --from-bp <pos>  : Use physical position(s) to define a variant range to
  --to-bp   <pos>    load.  --from-kb/--to-kb/--from-mb/--to-mb allow decimal
  --from-kb <pos>    values.  You must also specify a single chromosome (using
  --to-kb   <pos>    e.g. --chr) when using these flags.
  --from-mb <pos>
  --to-mb   <pos>
  --snps <var IDs...>  : Use IDs to specify variant range(s) to load or
  --exclude-snps <...>   exclude.  E.g. "--snps rs1111-rs2222, rs3333, rs4444".
  --thin <p>       : Randomly remove variants, retaining each with prob. p.
  --thin-count <n> : Randomly remove variants until n of them remain.
  --bp-space <bps> : Remove variants so that each pair is no closer than the
                     given bp distance.  (Equivalent to VCFtools --thin.)
  --thin-indiv <p>         : Randomly remove samples, retaining with prob. p.
  --thin-indiv-count <n>   : Randomly remove samples until n of them remain.
  --filter <f> <val(s)...> : Exclude all samples without a 3rd column entry in
                             the given file matching one of the given
                             space-separated value(s).
  --mfilter <n>            : Match against (n+2)th column instead.
  --geno [val]     : Exclude variants with missing call frequencies greater
                     than a threshold (default 0.1).  (Note that the default
                     threshold is only applied if --geno is invoked without a
                     parameter; when --geno is not invoked, no per-variant
                     missing call frequency ceiling is enforced at all.  Other
                     inclusion/exclusion default thresholds work the same way.)
  --mind [val]     : Exclude samples with missing call frequencies greater than
                     a threshold (default 0.1).
  --oblig-missing <f1> <f2> : Specify blocks of missing genotype calls for
                              --geno/--mind to ignore.  The first file should
                              have variant IDs in the first column and block
                              IDs in the second, while the second file should
                              have FIDs in the first column, IIDs in the
                              second, and block IDs in the third.
  --prune             : Remove samples with missing phenotypes.
  --maf [freq]        : Exclude variants with minor allele frequency lower than
                        a threshold (default 0.01).
  --max-maf <freq>    : Exclude variants with MAF greater than the threshold.
  --mac <ct>          : Exclude variants with minor allele count lower than the
    (alias: --min-ac)   given threshold.
  --max-mac <ct>      : Exclude variants with minor allele count greater than
    (alias: --max-ac)   the given threshold.
  --maf-succ       : Rule of succession MAF estimation (used in EIGENSOFT).
                     Given j observations of one allele and k >= j observations
                     of the other, infer a MAF of (j+1) / (j+k+2), rather than
                     the default j / (j+k).
  --read-freq <fn> : Estimate MAFs and heterozygote frequencies from the given
                     --freq[x] report, instead of the input fileset.
  --hwe <p> ['midp'] ['include-nonctrl'] : Exclude variants with Hardy-Weinberg
                                           equilibrium exact test p-values
                                           below a threshold.
  --me <t> <v> ['var-first'] : Filter out trios and variants with Mendel error
                               rates exceeding the given thresholds.
  --me-exclude-one [ratio]   : Make --me exclude only one sample per trio.
  --qual-scores <f> [qcol] [IDcol] [skip] : Filter out variants with
                                            out-of-range quality scores.
                                            Default range is now [0, \infty ).
  --qual-threshold <min qual score>       : Set --qual-scores range floor.
  --qual-max-threshold <max qual score>   : Set --qual-scores range ceiling.
  --allow-no-sex   : Do not treat ambiguous-sex samples as having missing
                     phenotypes in analysis commands.  (Automatic /w --no-sex.)
  --must-have-sex  : Force ambiguous-sex phenotypes to missing on
                     --make-bed/--make-just-fam/--recode/--write-covar.
  --filter-cases       : Include only cases in the current analysis.
  --filter-controls    : Include only controls.
  --filter-males       : Include only males.
  --filter-females     : Include only females.
  --filter-founders    : Include only founders.
  --filter-nonfounders : Include only nonfounders.
  --nonfounders        : Include nonfounders in allele freq/HWE calculations.
  --make-founders ['require-2-missing'] ['first'] :
    Clear parental IDs for those with 1+ missing parent(s).
  --recode-allele <fn> : With --recode A/A-transpose/AD, count alleles named in
                         the file (otherwise A1 alleles are always counted).
  --output-chr <MT code> : Set chromosome coding scheme in output files by
                           providing the desired human mitochondrial code.
                           (Options are '26', 'M', 'MT', '0M', 'chr26', 'chrM',
                           and 'chrMT'.)
  --output-missing-genotype <ch> : Set the code used to represent missing
                                   genotypes in output files (normally the
                                   --missing-genotype value).
  --output-missing-phenotype <s> : Set the string used to represent missing
                                   phenotypes in output files (normally the
                                   --missing-phenotype value).
  --zero-cluster <f> : In combination with --within/--family, set blocks of
                       genotype calls to missing.  The input file should have
                       variant IDs in the first column and cluster IDs in the
                       second.  This must now be used with --make-bed and no
                       other output commands.
  --set-hh-missing       : Cause --make-bed and --recode to set heterozygous
                           haploid genotypes to missing.
  --set-mixed-mt-missing : Cause --make-bed and --recode to set mixed MT
                           genotypes to missing.
  --split-x <bp1> <bp2> ['no-fail']
  --split-x <build> ['no-fail'] :
    Changes chromosome code of all chrX variants with bp position <= bp1 or >=
    bp2 to XY.  The following build codes are supported as shorthand:
    * 'b36'/'hg18' = NCBI 36, 2709521/154584237
    * 'b37'/'hg19' = GRCh37, 2699520/154931044
    * 'b38'/'hg38' = GRCh38, 2781479/155701383
    By default, PLINK errors out when no variants would be affected by
    --split-x; the 'no-fail' modifier (useful in scripts) overrides this.
  --merge-x ['no-fail'] : Merge XY chromosome back with X.
  --set-me-missing      : Cause --make-bed to set Mendel errors to missing.
  --fill-missing-a2     : Cause --make-bed to replace all missing calls with
                          homozygous A2 calls.
  --set-missing-var-ids <t>   : Given a template string with a '@' where the
                                chromosome code should go and '#' where the bp
                                coordinate belongs, --set-missing-var-ids
                                assigns chromosome-and-bp-based IDs to unnamed
                                variants.
                                You may also use '$1' and '$2' to refer to
                                allele names in the template string, and in
                                fact this becomes essential when multiple
                                variants share the same coordinate.
  --new-id-max-allele-len <n> : Specify maximum number of leading characters
                                from allele names to include in new variant IDs
                                (default 23).
  --missing-var-code <string> : Change unnamed variant code (default '.').
  --update-chr  <f> [chrcol] [IDcol]  [skip] : Update variant chromosome codes.
  --update-cm   <f> [cmcol]  [IDcol]  [skip] : Update centimorgan positions.
  --update-map  <f> [bpcol]  [IDcol]  [skip] : Update variant bp positions.
  --update-name <f> [newcol] [oldcol] [skip] : Update variant IDs.
  --update-alleles <fname> : Update variant allele codes.
  --allele1234 ['multichar'] : Interpret/recode A/C/G/T alleles as 1/2/3/4.
                               With 'multichar', converts all A/C/G/Ts in
                               allele names to 1/2/3/4s.
  --alleleACGT ['multichar'] : Reverse of --allele1234.
  --update-ids <f>     : Update sample IDs.
  --update-parents <f> : Update parental IDs.
  --update-sex <f> [n] : Update sexes.  Sex (1 or M = male, 2 or F = female, 0
                         = missing) is loaded from column n+2 (default n is 1).
  --flip <filename>    : Flip alleles (A<->T, C<->G) for SNP IDs in the file.
  --flip-subset <fn>   : Only apply --flip to samples in --flip-subset file.
  --flip-scan-window <ct+1> : Set --flip-scan max variant ct dist. (def. 10).
  --flip-scan-window-kb <x> : Set --flip-scan max kb distance (default 1000).
  --flip-scan-threshold <x> : Set --flip-scan min correlation (default 0.5).
  --keep-allele-order  : Keep the allele order defined in the .bim file,
  --real-ref-alleles     instead of forcing A2 to be the major allele.
                         --real-ref-alleles also removes 'PR' from the INFO
                         values emitted by --recode vcf{,-fid,-iid}.
  --a1-allele <f> [a1col] [IDcol] [skip] : Force alleles in the file to A1.
  --a2-allele <filename> [a2col] [IDcol] [skip] :
    Force alleles in the file to A2.  ("--a2-allele <VCF filename> 4 3 '#'",
    which scrapes reference allele assignments from a VCF file, is especially
    useful.)
  --indiv-sort <m> [f] : Specify FID/IID sort order.  The following four modes
                         are supported:
                         * 'none'/'0' keeps samples in the order they were
                           loaded.  Default for non-merge operations.
                         * 'natural'/'n' invokes 'natural sort', e.g.
                           'id2' < 'ID3' < 'id10'.  Default when merging.
                         * 'ascii'/'a' sorts in ASCII order, e.g.
                           'ID3' < 'id10' < 'id2'.
                         * 'file'/'f' uses the order in the given file (named
                           in the second parameter).
                         For now, only --merge/--bmerge/--merge-list and
                         --make-bed/--make-just-fam respect this flag.
  --with-phenotype ['no-parents'] [{no-sex | female-2}] :
    Include more sample info in new .cov file.
  --dummy-coding [N] ['no-round'] : Split categorical variables (n categories,
                                    2 < n <= N, default N is 49) into n-1
                                    binary dummy variables when writing
                                    covariate file.
  --merge-mode <n>   : Adjust --[b]merge/--merge-list behavior based on a
                       numeric code.
                       1 (default) = ignore missing calls, otherwise difference
                                     -> missing
                       2 = only overwrite originally missing calls
                       3 = only overwrite when nonmissing in new file
                       4/5 = never overwrite and always overwrite, respectively
                       6 = report all mismatching calls without merging
                       7 = report mismatching nonmissing calls without merging
  --merge-equal-pos  : With --merge/--bmerge/--merge-list, merge variants with
                       different names but identical positions.  (Exception:
                       same-position chromosome code 0 variants aren't merged.)
  --mendel-duos      : Make Mendel error checks consider samples with only one
                       parent in the dataset.
  --mendel-multigen  : Make Mendel error checks consider (great-)grandparental
                       genotypes when parental genotype data is missing.
  --ld-window <ct+1> : Set --r/--r2 max variant ct pairwise distance (usu. 10).
  --ld-window-kb <x> : Set --r/--r2 max kb pairwise distance (usually 1000).
  --ld-window-cm <x> : Set --r/--r2 max centimorgan pairwise distance.
  --ld-window-r2 <x> : Set threshold for --r2 report inclusion (usually 0.2).
  --ld-snp <var ID>  : Set first variant in all --r/--r2 pairs.
  --ld-snps <vID...> : Restrict first --r/--r2 variant to the given ranges.
  --ld-snp-list <f>  : Restrict first --r/--r2 var. to those named in the file.
  --list-all         : Generate the 'all' mode report when using --show-tags in
                       file mode.
  --tag-kb <kbs>     : Set --show-tags max tag kb distance (default 250).
  --tag-r2 <val>     : Set --show-tags min tag r-squared (default 0.8)
  --tag-mode2        : Use two-column --show-tags (file mode) I/O format.
  --ld-xchr <code>   : Set chrX model for --indep[-pairwise], --r/--r2,
                       --flip-scan, and --show-tags.
                       1 (default) = males coded 0/1, females 0/1/2 (A1 dosage)
                       2 = males coded 0/2
                       3 = males coded 0/2, but females given double weighting
  --blocks-max-kb <kbs>      : Set --blocks maximum haploblock span (def. 200).
  --blocks-min-maf <cutoff>  : Adjust --blocks MAF minimum (default 0.05).
  --blocks-strong-lowci <x>  : Set --blocks "strong LD" CI thresholds (defaults
  --blocks-strong-highci <x>   0.70 and 0.98).
  --blocks-recomb-highci <x> : Set 'recombination' CI threshold (default 0.90).
  --blocks-inform-frac <x>   : Force haploblock <strong LD pairs>:<total
                               informative pairs> ratios to be larger than this
                               value (default 0.95).
  --distance-wts exp=<x>        : When computing genomic distances, assign each
                                  variant a weight of (2q(1-q))^{-x}, where q
                                  is the loaded or inferred MAF.
  --read-dists <dist file> [id file] : Load a triangular binary distance matrix
                                       instead of recalculating from scratch.
  --ppc-gap <val>    : Minimum number of base pairs, in thousands, between
                       informative pairs of markers used in --genome PPC test.
                       500 if unspecified.
  --min <cutoff>     : Specify minimum PI_HAT for inclusion in --genome report.
  --max <cutoff>     : Specify maximum PI_HAT for inclusion in --genome report.
  --homozyg-match <> : Set minimum concordance across jointly homozygous
                       variants for a pairwise allelic match to be declared.
  --pool-size <ct>   : Set minimum size of pools in "--homozyg group" report.
  --read-genome <fn> : Load --genome report for --cluster/--neighbour, instead
                       of recalculating IBS and PPC test p-values from scratch.
  --ppc <p-val>    : Specify minimum PPC test p-value within a cluster.
  --mc <max size>  : Specify maximum cluster size.
  --mcc <c1> <c2>  : Specify maximum case and control counts per cluster.
  --K <min count>  : Specify minimum cluster count.
  --ibm <val>      : Specify minimum identity-by-missingness.
  --match <f> [mv] : Use covariate values to restrict clustering.  Without
                     --match-type, two samples can only be in the same cluster
                     if all covariates match.  The optional second parameter
                     specifies a covariate value to treat as missing.
  --match-type <f> : Refine interpretation of --match file.  The --match-type
                     file is expected to be a single line with as many entries
                     as the --match file has covariates; '0' entries specify
                     "negative matches" (i.e. samples with equal covariate
                     values cannot be in the same cluster), '1' entries specify
                     "positive matches" (default), and '-1' causes the
                     corresponding covariate to be ignored.
  --qmatch <f> [m] : Force all members of a cluster to have similar
  --qt <fname>       quantitative covariate values.  The --qmatch file contains
                     the covariate values, while the --qt file is a list of
                     nonnegative tolerances (and '-1's marking covariates to
                     skip).
  --pca-cluster-names <...> : These can be used individually or in combination
  --pca-clusters <fname>      to define a list of clusters to use in the basic
                              --pca computation.  (--pca-cluster-names expects
                              a space-delimited sequence of cluster names,
                              while --pca-clusters expects a file with one
                              cluster name per line.)  All samples outside
                              those clusters will then be projected on to the
                              calculated PCs.
  --mds-plot <dims> ['by-cluster'] ['eigendecomp'] ['eigvals'] :
    Multidimensional scaling analysis.  Requires --cluster.
  --cell <thresh>  : Skip some --model tests when a contingency table entry is
                     smaller than the given threshold.
  --condition <var ID> [{dominant | recessive}] : Add one variant as a --linear
                                                  or --logistic covariate.
  --condition-list <f> [{dominant | recessive}] : Add variants named in the
                                                  file as --linear/--logistic
                                                  covariates.
  --parameters <...>  : Include only the given covariates/interactions in the
                        --linear/--logistic models, identified by a list of
                        1-based indices and/or ranges of them.
  --tests <all> [...] : Perform a (joint) test on the specified term(s) in the
                        --linear/--logistic model, identified by 1-based
                        indices and/or ranges of them.  If permutation was
                        requested, it is based on this test.
                        * Note that, when --parameters is also present, the
                          indices refer to the terms remaining AFTER pruning by
                          --parameters.
                        * You can use "--tests all" to include all terms.
  --vif <max VIF>     : Set VIF threshold for --linear multicollinearity check
                        (default 50).
  --xchr-model <code> : Set the X chromosome --linear/--logistic model.
                        0 = skip sex and haploid chromosomes
                        1 (default) = add sex as a covariate on X chromosome
                        2 = code male genotypes 0/2 instead of 0/1
                        3 = test for interaction between genotype and sex
  --lasso-select-covars [cov(s)...] : Subject some or all covariates to LASSO
                                      model selection.
  --adjust ['gc'] ['log10'] ['qq-plot'] : Report some multiple-testing
                                          corrections.
  --lambda <val>   : Set genomic control lambda for --adjust.
  --ci <size>      : Report confidence intervals for odds ratios.
  --pfilter <val>  : Filter out association test results with higher p-values.
  --aperm <min perms - 1> [max perms] [alpha] [beta] [init interval] [slope] :
    Set up to six parameters controlling adaptive permutation tests.
    * The first two control the minimum and maximum number of permutations that
      may be run for each variant; default values are 5 and 1000000.
    * The next two control the early termination condition.  A
      100% * (1 - beta/2T) confidence interval is calculated for each empirical
      p-value, where T is the total number of variants; whenever this
      confidence interval doesn't contain alpha, the variant is exempted from
      further permutation testing.  Default values are 0 and 1e-4.
    * The last two control when the early termination condition is checked.  If
      a check occurs at permutation #p, the next check occurs after
      <slope>p + <init interval> more permutations (rounded down).  Default
      initial interval is 1, and default slope is 0.001.
  --mperm-save     : Save best max(T) permutation test statistics.
  --mperm-save-all : Save all max(T) permutation test statistics.
  --set-p <p-val>        : Adjust set test significant variant p-value ceiling
                           (default 0.05).
  --set-r2 [v] ['write'] : Adjust set test significant variant pairwise r^2
                           ceiling (default 0.5).  'write' causes violating
                          pairs to be dumped to <output prefix>.ldset.
  --set-max <ct>         : Adjust set test maximum # of significant variants
                           considered per set (default 5).
  --set-test-lambda <v>  : Specify genomic control correction for set test.
  --border <kbs>            : Extend --annotate range intervals by given # kbs.
  --annotate-snp-field <nm> : Set --annotate variant ID field name.
  --clump-p1 <pval> : Set --clump index var. p-value ceiling (default 1e-4).
  --clump-p2 <pval> : Set --clump secondary p-value threshold (default 0.01).
  --clump-r2 <r^2>  : Set --clump r^2 threshold (default 0.5).
  --clump-kb <kbs>  : Set --clump kb radius (default 250).
  --clump-snp-field <n...>  : Set --clump variant ID field name (default
                              'SNP').  With multiple field names, earlier names
                              take precedence over later ones.
  --clump-field <name...>   : Set --clump p-value field name (default 'P').
  --clump-allow-overlap     : Let --clump non-index vars. join multiple clumps.
  --clump-verbose           : Request extended --clump report.
  --clump-annotate <hdr...> : Include named extra fields in --clump-verbose and
                              --clump-best reports.  (Field names can be
                              separated with spaces or commas.)
  --clump-range <filename>  : Report overlaps between clumps and regions.
  --clump-range-border <kb> : Stretch regions in --clump-range file.
  --clump-index-first       : Extract --clump index vars. from only first file.
  --clump-replicate         : Exclude clumps which contain secondary results
                              from only one file.
  --clump-best              : Report best proxy for each --clump index var.
  --meta-analysis-chr-field <n...> : Set --meta-analysis chromosome, variant
  --meta-analysis-snp-field <n...>   ID, position, A1/A2 allele, p-value,
  --meta-analysis-bp-field <n...>    standard error, and/or effective sample
  --meta-analysis-a1-field <n...>    size field names.
  --meta-analysis-a2-field <n...>    Defaults are 'CHR', 'SNP', 'BP', 'A1',
  --meta-analysis-p-field <n...>     'A2', 'P', 'SE', and 'NMISS',
  --meta-analysis-se-field <n...>    respectively.  When multiple parameters
  --meta-analysis-ess-field <n...>   are given to these flags, earlier names
                                     take precedence over later ones.
                                     Note that, if the numbers of cases and
                                     controls are unequal, effective sample
                                     size should be
                                       4 / (1/<# cases> + 1/<# controls>).
  --meta-analysis-report-dups      : When a variant appears multiple times in
                                     in the same file, report that.
  --gene-list-border <kbs>   : Extend --gene-report regions by given # of kbs.
  --gene-subset <filename>   : Specify gene name subset for --gene-report.
  --gene-report-snp-field <> : Set --gene-report variant ID field name (default
                               'SNP').  Only relevant with --extract.
  --gap <kbs>      : Set "--fast-epistasis case-only" min. gap (default 1000).
  --epi1 <p-value> : Set --[fast-]epistasis reporting threshold (default
                     5e-6 for 'boost', 1e-4 otherwise).
  --epi2 <p-value> : Set threshold for contributing to SIG_E count (def. 0.01).
  --je-cellmin <n> : Set required number of observations per 3x3x2 contingency
                     table cell for joint-effects test (default 5).
  --q-score-range <range file> <data file> [i] [j] ['header'] :
    Apply --score to subset(s) of variants in the primary score list based
    on e.g. p-value ranges.
    * The first file should have range labels in the first column, p-value
      lower bounds in the second column, and upper bounds in the third column.
      Lines with too few entries, or nonnumeric values in the second or third
      column, are ignored.
    * The second file should contain a variant ID and a p-value on each
      nonempty line (except possibly the first).  Variant IDs are read from
      column #i and p-values are read from column #j, where i defaults to 1 and
      j defaults to i+1.  The 'header' modifier causes the first nonempty line
      of this file to be skipped.
  --R-port <port #>  : Connect to Rserve on a port other than 6311.
  --R-host <host>    : Connect to Rserve host.
  --R-socket <sock>  : Connect to Rserve socket.
  --parallel <k> <n> : Divide the output matrix into n pieces, and only compute
                       the kth piece.  The primary output file will have the
                       piece number included in its name, e.g. plink.rel.13 or
                       plink.rel.13.gz if k is 13.  Concatenating these files
                       in order will yield the full matrix of interest.  (Yes,
                       this can be done before unzipping.)
                       N.B. This generally cannot be used to directly write a
                       symmetric square matrix.  Choose square0 or triangle
                       shape instead, and postprocess as necessary.
  --memory <val>     : Set size, in MB, of initial workspace malloc attempt.
                       (Practically mandatory when using GNU parallel.)
  --threads <val>    : Set maximum number of concurrent threads.
                       This has one known limitation: some BLAS/LAPACK linear
                       algebra operations are multithreaded in a way that PLINK
                       cannot control.  If this is problematic, you should
                       recompile against single-threaded BLAS/LAPACK.
  --d <char>         : Change variant/covariate range delimiter (normally '-').
  --seed <val...>    : Set random number seed(s).  Each value must be an
                       integer between 0 and 4294967295 inclusive.
  --perm-batch-size <val> : Set number of permutations per batch for some
                            permutation tests.
  --native           : Allow Intel MKL to use processor-dependent code paths.
  --output-min-p <p> : Specify minimum p-value to write to reports.
  --debug            : Use slower, more crash-resistant logging method.

Primary methods paper:
Chang CC, Chow CC, Tellier LCAM, Vattikuti S, Purcell SM, Lee JJ (2015)
Second-generation PLINK: rising to the challenge of larger and richer datasets.
GigaScience, 4.

For further documentation and support, consult the main webpage
(https://www.cog-genomics.org/plink/1.9 ) and/or the mailing list
(https://groups.google.com/d/forum/plink2-users ).