Changes in feeding habits promoted the differentiation of the composition and function of gut microbiotas between domestic dogs (Canis lupus familiaris) and gray wolves (Canis lupus)

Wolves (Canis lupus) and their domesticated and close relatives, dogs (Canis lupus familiaris), have great differences in their diets and living environments. To the best of our knowledge, the fundamental question of how the abundance and function of the gut microbiota of domestic dogs evolved to adapt to the changes in host feeding habits has yet to be addressed. In this study, our comparative analyses of gut metagenomes showed that the abundance of gut microbiota between the two species have some significant differences. Furthermore, a number of taxa observed in higher numbers in domestic dogs are related to carbohydrate metabolism, which may be because that there were more complicated polysaccharides in dogs diets than that in wolves diets. A significant difference in the abundance of genes encoding glycosyltransferase family 34 (GT34), carbohydrate-binding module family 25 (CBM25), and glycoside hydrolase family 13 (GH13) between the gut microbiota metagenomes of domestic dogs and gray wolves also supported this observation. Furthermore, the domestic dog gut microbiota has greater valine, leucine and isoleucine biosynthesis and nitrogen metabolism. This result showed that compared with wolves, the domestic dog diet contains a smaller amount of animal protein, which is consistent with the dietary composition of wolves and dogs. Our results indicate that the function and abundance of gut microbiota of domestic dogs has been adapted to domestication, which is of great significance for the ability of domestic dogs to adapt to changes in food composition.


Introduction
The domestication of animals and plants has had an important impact on the progress of human civilization (Diamond 2002). The discovery and domestication of plants and animals transformed human civilization from the primitive hunting lifestyle to the traditional pattern of agricultural farming, greatly promoting the development of human civilization (Zeder 2015;Zohary and Hopf 2000). The selective breeding of animals and plants during the process of domestication has significantly altered their genes and traits compared to those of wild populations (Bruford et al. 2003;Giuffra et al. 2000;Rubin et al. 2010). Canines are carnivores whose wide distribution is closely related to humans. Some studies have shown that dogs, the first domesticated vertebrate, were domesticated by humans from wolves in East Asia approximately 15,000 years ago (Savolainen et al. 2002). However, recent findings suggest that the domestication of dogs may have occurred earlier, during the late Pleistocene in modern day Belgium (Germonpré et al. 2009), the Czech Republic (Germonpré et al. 2012), and southwestern Siberia (Ovodov et al. 2011). Therefore, the timing and location of dog domestication is still unclear (Larson et al. 2012;Perri 2016). In dogs, genes for digestion, metabolism, nerves and tumors show a coevolutionary trend with humans (Wang et al. 2013), and the increased ability of dogs to digest starch in food relative to wolves has been demonstrated at the genome level (Axelsson et al. 2013). A mitochondrial genomes study for present-day dogs and wolves and fossil canids suggested that wolves may come into contact with European hunters and share the food from hunters, thus, Europeans domesticated wolves and turned them into dogs (Thalmann et al. 2013). Bosch et al. (2015) suggested that as real carnivores, the main food source of wild wolves were mammals including ungulates and non-ungulates. Although wild wolves sometimes choose to eat some vegetations, the amount of vegetal matters they eat is negligible compared to the amount of meat they eat. Gao et al. (1996) and Yan et al. (2006) found that in China, the main food sources of wild wolves lived in Inner Mongolia were ungulates, including a considerable number of livestock. Unlike wild wolves, present-day dogs have a more diverse diet, mainly including starch food, fat and protein (Lasater and Mooers 1993;Rowe et al. 1997). However, although the whole genome characteristics of wolves and domestic dogs has been well studied, the differences in the structure and function of digestive tract flora between wolves and domestic dogs remains open to question.
There are 10 13 -10 14 gut microbiota living in the human digestive tract, which are closely associated with host digestive physiology, and these microbes form a large and complex ecosystem (Eckburg et al. 2005;Gill et al. 2006). The gut microbiota is important to mammals, affecting many aspects of animal physiology and health, such as digestion and nutrient absorption, energy supply, fat metabolism, and immune regulation (Guarner and Malagelada 2003). A comprehensive understanding of the composition and structure of the intestinal microbiota is the first step to reveal the interactions between intestinal microorganisms and animal hosts. In 2015, a study on the gut microbiota of Hadza hunter-gatherers indicated that the complex polysaccharides in the diet of this group made their gut microbiota adapted to metabolize a broad range of carbohydrates (Rampelli et al. 2015). Another study also confirmed that the variety of polysaccharides in food has an effect on the composition and function of gut microbiota (Hehemann et al. 2010). In addition, there are a number of phenomena of gut microbiota and host adaptive evolution in wild animals. A previous study revealed that the gut microbiota of the giant panda helps the host to digest cellulose, since the giant panda lacks the gene encoding cellulase (Zhu et al. 2011). Although creosote toxins in plants may be harmful to many herbivores, the function and structure of gut microbiota of the desert woodrat has changed to help the host adapt to the toxins in such plants and allow it to use such plants as a food source (Kohl et al. 2014).
Metagenomics was the first proposed method to study the whole genome information contained in a microbial community (Handelsman et al. 1998) and was defined as a genomics technology to study the microbial community without culturing individual microorganisms (Chen and Pachter 2005). In this study, we sequenced metagenomic DNA using an Illumina HiSeq platform (Novo gene) to identify the genes and genomes from the intestinal microbiota in feces from dogs and wolves. The main goal of this study was to compare the composition and function of gut microbiotas between dogs and wolves and screen the gut microbiota and functions of dogs that are significant different from wolves, which is meaningful for understanding that domestic dogs can eat a much more complex variety of foods compared to wolves.

Sample collection
All fecal samples were collected from four adult wolves and three adult dogs living in the Dalai Lake National Nature Reserve in the Inner Mongolia region of northern China during November and December 2016. These samples were divided into two groups: the four wolf fecal samples were named CL1.1-CL1.4, and the three dog fecal samples were named CL2.1-CL2.3 (Table 1). The wolves were enclosure in iron fences and have plenty of room to move. The diets of the four wolves included raw chicken, sheepskin, and lamb to simulate the food composition of wild wolves as much as possible, and the diets of the three dogs included leftover foods, including vegetables, steamed buns, noodles, fruits and meats. The selected fecal samples were collected from the surface of fresh snow after no more than 2 h to ensure the samples were clean and unmistakable. The fecal samples were collected at an ambient temperature of − 20 °C and were stored at − 80 °C in an Ultra-Low Temperature Freezer Dalai Lake National Nature Reserve before DNA extraction to guarantee the integrity of the DNA from intestinal flora.

DNA extraction
Genomic DNA was extracted from feces using a phenolchloroform extraction method following the guidelines for subsequent analysis (Köchl et al. 2005). The potential contamination and degree of degradation of the extracted DNA was monitored by 1% AGE (agarose gel electrophoresis). Next, we used a NanoPhotometer ® spectrophotometer (IMPLEN, CA, USA) to assess the purity of the DNA, and a Qubit ® dsDNA Assay kit was used to measure the concentration of DNA with a Qubit ® 2.0 fluorometer (Life Technologies, CA, USA). Only DNA samples with concentrations above 1 µg were used for subsequent library construction.

Metagenomic sequencing
The qualifying DNA samples were broken into approximately 350 bp fragments using ultrasonic waves (Focused-ultrasonication with AFA Technology, Covaris, UK). The fragments were end-polished, A-tailed, purified, ligated with full-length adaptors and further PCR amplificated to generate libraries. After construction, the libraries were diluted to 2 ng/µl using a Qubit 2.0 (Invitrogen, USA). Next, the fragment sizes in the library were determined using an Agilent 2100 Bioanalyzer (Agilent, USA), and the effective concentration (> 3 nM) of the library was accurately quantified using Real-time Quantitative PCR Detecting System (Real-time q-PCR) (Panaro et al. 2000;Smith and Osborn 2009). The index-coded sample clustering was conducted with a cBot cluster generation system following the manufacturer's instructions. An Illumina HiSeq xten platform (Illumina, USA) was used to sequence the libraries and generated paired-end reads after cluster generation. The data set supporting the results of this article is available in the Sequencing Read Archive (SRA) database, accession numbers SRP179020.

Data analysis
After sequencing, more than 6.4 Gb of sequences was generated for each DNA sample. The Short Oligonucleotide Analysis Package aligner (SOAP aligner) was used to remove reads with low quality scores, large numbers of "N" bases, or those contaminated with adapters to obtain clean reads (Gu et al. 2013). The effective rates of clean data of samples are greater than 99.4%. Short Oligonucleotide Analysis Package denovo (SOAP denovo) was used to finish the assembly analysis of clean data (Luo et al. 2012). The length distribution of scarftigs were acceptable for subsequent analysis after treatment by SOAP denovo. The open reading frame (ORF) prediction of scarftigs and subsequent results (gene catalogue) were performed using MetaGeneMark and CD-HIT (Fu et al. 2012;Karlsson et al. 2013;Li et al. 2014;Mende et al. 2012;Oh et al. 2014). Basic statistics were performed based on the abundances of various genes in the samples in the gene catalogue (Table 2). We compared unigenes with the sequences of bacteria, fungi, archaea and viruses, which were extracted from NR database of NCBI (Version: 2016-11-05) using DIA-MOND (Buchfink et al. 2015). Correlation analysis to assess correlations between samples, clustering analysis to assess the similarities of bacterial taxa between samples at the phylum and genus levels, and principal component analysis (PCA) and non-metric multidimensional scaling (NMDS) to assess the significantly different bacterial taxa and gene functions between species were all calculated with R software. ANOSIM analysis was used to test whether the differences between groups were significantly greater than those within the group to assess the importance of the grouping (Clarke 1993), which was calculated using QIIME (Version: 1.7.0). Metastats was used to identify the species with significant differences (White et al. 2009). In addition, DIAMOND was used to compare the unigenes with the Kyoto Encyclopedia of Genes and Genomes (KEGG) and Carbohydrate-Active Enzymes Database, Version: 2014.11.25 (CAZy) databases to calculate the relative abundances of the gut microbiota at different functional levels. The basic steps of functional annotations are as follows: (1) DIAMOND software was used to compare unigenes with various functional databases (BLASTp, e value is ≤ 1e−5) (Feng et al. 2015); (2) screening of comparisons of results and the results of the highest score comparison (one HSP > 60 bits) were selected for subsequent analysis (Li et al. 2014); (3) relative abundances at different functional levels were compared based on the results of the comparison, where the relative abundance of each functional level is equal to the sum of the relative abundance of the genes in the function hierarchy (Karlsson et al. 2013); (4) the number of genes in each sample for each classification level table were based on the results of the functional annotation and gene abundance tables; and (5) the dimension reduction analysis and sample cluster analysis of Bray-Curtis distances were based on the abundance of functions.

Results
A total of 43 Gb high-quality sequences were obtained from the samples of seven individual animals. After quality control and filtering, the total amount of clean data also remained at about 43 Gb, with an effective percentage of more than 99.4% (Table 3).

Correlation analysis of fecal microbiomes
The result of the heatmap of correlation coefficients revealed that the correlation within groups is stronger than that between groups (Fig. 1). The ANOSIM analysis based on species richness and the cluster analysis based on Bray-Curtis distance reach the same conclusion (Fig. 1). These results indicated that the grouping is meaningful and there was a difference in bacterial community structure between the two species.

Comparison of significantly different microbial groups
The top 10 taxa with the maximum relative abundances were selected for each sample (Fig. 2). The results revealed that the top five phyla present in the microbiomes both of wolves and dogs were as follows: Bacteroidetes, Fusobacteria, Firmicutes, Proteobacteria, and Actinobacteria. The top five genera in the microbiomes both of wolves and dogs were Bacteroides, Fusobacterium, Prevotella, Clostridium, and Blautia.
Metastat was used to study the significant differences in species abundance between the two groups at different taxonomic levels. The species with significant differences were selected according to the q-values, and the abundance distributions of the different species between the groups are shown in (Fig. 3). At the family level, the bacterial taxa with significantly higher abundances (q < 0.05) in dogs than in wolves were Ruminococcaceae, Spirochaetaceae, Lactobacillaceae, and Desulfuromonadaceae. The difference of Lactobacillaceae was observed to be extremely significant in dogs and wolves (q < 0.01). At the genus level, the bacterial taxa that had significantly higher abundances in dogs than that in wolves were Streptobacillus, Desulfuromusa, Ruegeria, Lactobacillus, Carnobacterium, Faecalibacterium and Treponema.
According to the annotated KEGG results, no significant differences in functional categories were observed between dogs and wolves at the first and second levels, but differences were observed at the third level (p < 0.05). The functional categories with significantly higher abundances in domestic dogs than wolves at the third level were valine, leucine and isoleucine biosynthesis (ko00290) and nitrogen metabolism (ko00910). We also observed that the domestic dog metagenome has a higher enrichment for GT34, CBM25, and GH13 (p < 0.01) compared to wolves. It is worth noting that we did not observe carbohydrate-binding module 25 in wolves. At the third level, six enzymes belonging  (Fig. 7).

Discussion
By comparing the gut microbiota of wolves and domestic dogs, we observed that the differences in microbial species and genes were related to many functions, such as starch and cellulose metabolism. The top five most abundant phyla in domestic dogs and wolves were Bacteroidetes, Fusobacteria, Firmicutes, Proteobacteria, and Actinobacteria, similar to the results of (Wu et al. 2017).

Fig. 1
Correlation coefficient between samples, the ANOSIM analysis based on species, and the cluster analysis based on Bray-Curtis distance. a Correlation coefficient. The different colors represent the different Spearman correlation coefficient, the deeper the color is, the greater the absolute value of the correlation between samples; b ANOSIM analysis at the phylum level. The x-axis for grouping information, y-axis for distance information, R-value is between (− 1, 1) and R-value is greater than 0, indicating there are significant differences between groups; c the cluster analysis at the family level; and d the cluster analysis at the genus level At other taxonomic levels, such as family and genus, no significant differences were observed in the top five taxa between dogs and wolves (q < 0.05). The microbial taxa with significantly higher abundances in domestic dogs significantly correlated with their complex polysaccharide diet, including the families Ruminococcaceae (Brown and Brown 1966;Huws et al. 2011) and Desulfuromonadaceae (Greene 2014), which are related to cellulose digestion, and Lactobacillaceae, members of which are related to the fermentation of glucose, as well as the genera Streptobacillus (Gharbia and Edwards 2015), Desulfuromusa (Werner and Kai 2015), Lactobacillus (Zaunmüller et al. 2006), Carnobacterium (Leisner et al. 2007), and Faecalibacterium (Lopezsiles et al. 2012).
These results indicate that the composition and function of gut microbiota in domesticated dogs may have been influenced by human food. Although PCA and NMDS analyses based on the comments of carbohydrate-active enzymes and metabolic pathway were able to divide dogs and wolves into two populations, the annotation results for the top 6 functions of gut microbiota based for the two databases  showed no significant differences (Fig. 4). Therefore, we used a metastat analysis to evaluate all the functions of the annotations, including non-essential functions. According to the metastat analysis, the significant differences in bacterial functions were identified based on both of the databases.
The KEGG database analysis showed that the gut microbiota of domestic dogs was characterized by an enrichment of genes involved in carbohydrate metabolism (CL1, 13.5 ± 2.5; CL2, 15.3 ± 1.6), global and overview maps (CL1, 13.2 ± 2.7; CL2, 15.6 ± 1.8), and amino acid metabolism (CL1, 13.3 ± 2.6; CL2, 15.5 ± 1.9). The higher gene percentages in these classifications suggested a complex diet intake and an ability for polysaccharide absorption in domestic dogs. In particular, the significantly higher abundance of genes involved in valine, leucine and isoleucine biosynthesis (ko00290) and nitrogen metabolism (ko00910) in domestic dogs also revealed that the gut microbiota of domestic dogs is more active in branched-chain amino acid (BCAA) metabolism. Some previous studies of gut microbiota also showed that the significant differences in these two pathways corresponded to a low animal protein intake (Rampelli et al. 2015;Schnorr et al. 2014). This result is in line with the In addition, we observed that the annotated pathway Biosynthesis of amino acids (ID: map01230) was significantly different between wolves and domestic dogs (Fig. 6). In this, metabolic pathway l-cysteine is produced from l-cystathionine by cystathionine gamma-lyase (EC:4.4.1.1). This amino acid is widely present in most high-protein foods, including animal and plant sources. Fig. 6 The annotated pathway of amino acids biosynthesis Cysteine is found in many meat products (including pork, sausage meat, chicken, turkey, duck and lunch meat), eggs and dairy. In plant-based foods, red peppers, garlic, onions, broccoli, brussels sprout, oats, granola, wheat germ and sprouted lentils have a high cysteine content. Cysteine plays an important role in animals, and the lack of this amino acid can lead to several diseases (Ames 1999;Chévezbarrios et al. 2000;Goodman et al. 2000;Novelli et al. 2009;Silva et al. 2013). The gut microbiota of domestic dogs can synthesize cysteine more efficiently than wolves by producing cysteine using cystathionine gamma-lyase directly. We speculated that wolves ingest sufficient amounts of cysteine because the foods wolves eat are rich in cysteine. The diet of dogs has more starch and cellulose than cysteine. To avoid diseases caused by a lack of cysteine, the gut microbiota needs to supply a large amount of cysteine to their hosts. Thus, they possess a more efficient way of synthesizing cysteine. This result may be due to the reduction of cysteine intake by dogs from food.
According to the results of this study, we determined that most of the bacterial taxa at the family and genus level that have a more significant presence in dogs than in wolves are related to cellulose and starch digestion, and the most significantly different enzymes were associated with carbohydrates, especially amylose, sucrose, and maltose. Therefore, we believe that the significant differences in these bacteria and enzymes have a direct relationship with the changes in the diets of dogs resulting from the domestication of dogs by humans.
In conclusion, we explored the different compositions and functions of the gut microbiota of wolves and domestic dogs using metagenomic sequencing analysis. The findings of our study provided a unique insight into the different functions and compositions of the gut microbiomes between dogs and wolves and increases our understanding of the bacterial ecosystems in canids. However, the sample size in this study was small because of the relatively small number of wolves in China and it is difficult to get proper samples. Further researches are needed to test and verify our findings and in subsequent studies we will continue to enlarge the sample size to obtain more convincing results.