J. Dev. Biol. 2022, 10, x FOR PEER REVIEW 4 of 21 Review Seeking sense in the Hox gene cluster Stephen J. Gaunt 1 Citation: Gaunt, S. J. Seeking sense in the Hox gene cluster. J. Dev. Biol. 2022, 10, x. https://doi.org/10.3390/xxxxx Academic Editor: Firstname Lastname Received: date Accepted: date Published: date Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. Copyright: © 2022 by the authors. Submitted for possible open access publication under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). 1 Department of Zoology, University of Cambridge, Downing Street, Cambridge, CB2 3EJ, UK.; sg397@cam.ac.uk Abstract: The Hox gene cluster, responsible for patterning of the head-tail axis, is an ancestral feature of all bilaterally symmetrical animals (the Bilateria) that remains intact in a wide range of species. We can say that the Hox cluster evolved successfully only once since it is commonly the same in all groups, with labial-like genes at one end of the cluster expressed in the anterior embryo, and Abd-B-like genes at the other end of the cluster expressed posteriorly. This review attempts to make sense of the Hox gene cluster, and to address the following questions. How did the Hox cluster form in the protostome/deuterostome last common ancestor, and why was this with a particular head-tail polarity? Why is gene clustering usually maintained? Why is there collinearity between order of genes along the cluster and positions of their expressions along the embryo? Why do the Hox gene expression domains overlap along the embryo? Why have vertebrates duplicated the Hox cluster? Why do Hox gene knockouts typically result in anterior homeotic transformations? How do animals adapt their Hox clusters to evolve new structural patterns along the head-tail axis? Keywords: Hox cluster; collinearity; evolution; axial morphology; gene knockout; Bilateria; Cnidaria 1. Introduction An unexpected discovery in the 1980s was that both arthropods (Drosophila) and vertebrates (mice) utilize conserved clusters of Hox genes in order to specify pattern formation along their head-tail axes. The widely held conclusion is that the common ancestor to these two groups (the protostome-deuterostome last common ancestor, or P-DLCA) already possessed a cluster of seven, or more, Hox genes which it used, most likely as in its descendants today, for specification of distinct body parts along the anteror-to-posterior (A-P) axis (Figure 1) [1-3]. Figure 1. Homologous Hox gene clusters in Drosophila, mouse/human and, by inference, their common ancestor. The ancestor (the protostome-deuterostome last common ancestor, P-DLCA) may have had more than the seven genes shown here [4]. Genes that share the same numbers and shading intensities are most recently related by descent. Hox-derived genes in Drosophila which no longer function as true Hox genes are named in grey text. Arrows indicate directions of transcription (presumed for ancestor). ANT-C, Antennapedia complex; BX-C, Bithorax complex. The Hox cluster is an ancestral feature of all bilaterally symmetrical animals (Bilateria) retained in many, though not all, species (Section 2.2). We can say that it evolved successfully only once since the cluster is the same in all groups, with labial-like genes at one end of the cluster expressed in the anterior embryo, and Abd-B-like genes at the other end of the cluster expressed posteriorly [5,6]. Although now disrupted in some descendants, the Hox gene cluster in many other descendants, both protostome and deuterostome, is still conserved to a large extent in its inferred ancestral form [5,6]. The entire cluster has undergone duplications in vertebrates (Figure 1; Section 2.6) though it remains unduplicated in invertebrate deuterostomes [5,6]. From his pioneering analyses of Drosophila developmental mutants Ed Lewis [7] was the first to propose the following (Figure 2A): that the clustered set of genes now known as Hox genes are expressed in a series of partially overlapping domains along the length of the embryo; that the order of the genes along the chromosome corresponds with the order of their expression domains along the head-tail axis (the spatial collinearity rule); and that the blend of Hox genes active in each segment or A-P domain of the body is responsible for specification of its structure. Subsequent work showed that these predictions hold true for many or most bilaterally symmetrical animals (Sections 2.4, 2.7, 2.8). Figure 2. Hox expression patterns which do, or do not, conform to Lewis’s collinearity model. (A) Conforming to Lewis’s model for Drosophila [7], and as now known to apply in many different animals [5,6], Hox genes are expressed in a series of partially overlapping domains along the body with the order of genes along the cluster being collinear with the order of their expression domains along the head-tail axis. This is shown in Figure 2A-left; and the genes active at different positions along the body are shown in Figure 2A-right. (B) Scenario where Hox3 and Hox5 gene expressions do not conform to collinearity. Arrows show directions of transcription. Figure from Gaunt 2015 [8]. Lewis’s model describes Hox genes as expressing or non-expressing but we now understand that these states are more accurately described as, respectively, expressible or non-expressible. Recent studies in both mice [9,10] and Drosophila [11] show that the expressible Hox genes in a given cell (Fig. 2A-right) are in an open chromatin state, characterized by Trithorax (Trx) protein binding, while the non-expressible genes are in a closed chromatin state, characterized by Polycomb (Pc) protein binding (Figure 3). These chromatin states are usually heritable from one cell generation to the next, thereby ensuring that Hox expressibility patterns acquired in the early embryo are faithfully remembered at all later stages, enabling guidance throughout the course of development. At any region along the body there is typically only one boundary between expressible and non-expressible genes in both mouse [9,10] (Figure 2A-right, 3) and Drosophila [11]. In the subtle difference from Lewis’s original proposal, expressible genes need not always be expressing, as illustrated in Sections 2.4, 2.8. Expression depends upon availability of activating or repressive transcription factors and may change in a tissue over developmental time. Expressible Hox genes have been described as 'open for business' [12,13]. Regions of the body where Hox genes are non-expressible (closed for business; red zones in Figure 3) typically cannot express these genes at any time. Figure 3. Discreet domains of open and closed chromatin generally support Lewis’s model. Antibody studies show correspondence between the position of cells along the head-tail axis and the distributions of Hox genes between open (Hox expressible, green domain; Trx-rich) and closed (Hox non-expressible, red domain; Pc-rich) chromatin states. At each level along the body there is only a single boundary between these states, supporting Lewis’s model (Figure 2A-right). Re-drawn/modified from Noordermeer et al. [10]. In addition to spatial collinearity in Hox expression, many species including vertebrates [14], cephalochordates [15], some annelids [16] and some arthropods [17] display temporal collinearity. That is, the order of the Hox genes along the chromosome also corresponds with the time of their first expression in the embryo, such that anterior genes are expressed earlier than any more posterior genes. This review focuses on the role of Hox clusters in patterning the head-tail axis of bilaterians that develop in a regulative manner. Alternative, cell-lineage dependent (deterministic) development, such as seen, for example, in nematodes [18] and some ascidians [19] is not addressed. 2.1. Seeking sense in the evolutionary origin of Hox clustering and transcriptional direction Clustering arose due to gene duplication from an original proto-Hox gene. Duplication occurs due to unequal cross-over of DNA during meiosis in germ cells. This may be due to cross-over between repeat elements in the DNA, or between other regions of similar genetic sequence. It results in one chromosome becoming deleted in a genetic fragment – such as a Hox gene in our case – while its attached homologue incorporates the extra fragment. This process, called ‘tandem gene duplication’ leaves one chromosome with both the maternal- and paternal-derived copies of the Hox gene lying in tandem on the same DNA strand, and in the same orientation [20,21]. If this expanded chromosome contributes to a zygote then the additional Hox copy offers new scope for evolutionary change. For example, if it acquires a new anterior boundary of expression and function, due to a change in regulation or mutation, then this can specify a new zone of development along the head-tail axis. It is usually suggested that this re-purposing of the new Hox gene occurs after the duplication event [21]. However, it has been proposed that two identical Hox genes may provide a selective disadvantage, and that a more reasonable hypothesis might be that the genes were initially alleles of the same gene that already possessed some useful differences in expression and/or function [22]. Repeated cycles of tandem gene duplication then led to the growing Hox gene cluster, permitting ever increasing complexity of body structures along the head-tail axis. The process neatly explains not only why the Hox genes were formed in clusters, but also why they transcribe in the same direction. The cluster developed with spatial collinearity, and its polarity (that is, whether anterior-expressed gene lie upstream or downstream of posterior genes) was likely established with the initial duplication/re-purposing event [23]. Hox genes are members of the ANTP class of homeobox-containing gene. This also includes developmental gene groups paraHox, Dlx and NK, and all are thought to have arisen by tandem duplication from the same ANTP class proto-Hox gene [24]. Although paraHox, Dlx and NK genes are now usually dispersed from the Hox cluster some of these remain linked to the Hox cluster in at least some protostome and deuterostome species [24-27]. Apart from the ANTP class genes there are ten other classes of homeobox gene [24], such as PRD, POU and LIM, and all classes likely arose by tandem gene duplication from an original proto-homeobox gene. The question then arises as to when did these events take place. Figure 4 indicates the origin of homeobox genes within the tree of life. There are no ANTP class genes outside the metazoa [24]. However, duplication and diversification of ANTP class genes must have occurred very early in pre-bilaterian evolution because at least some poriferans (sponges) possess both NK [24] and paraHox genes [28] even though they lack Hox genes, which are presumed to have been secondarily lost [28]. Cnidarians possess Hox, paraHox and NK genes, but there is debate over whether cnidarian Hox genes are strict orthologues of bilaterian Hox genes, and whether they show spatial collinearity (Section 2.9). Proceeding backwards in time, greatest proliferation of homeobox genes occurred with the advent of multicellularity in animals (metazoa), and also in plants and fungi (Figure 4) [29]. Bacteria do not have homeobox genes. However, they do have helix-loop-helix genes which have similarities in structure and function, and which may have been the ancestral progenitors of homeobox genes [29]. Overall, the distribution of homeobox and Hox genes throughout the tree of life makes sense in terms of a progressive development over time of the Hox gene cluster. Figure 4. Origins of homeobox genes within the tree of life. Branching orders of Porifera relative to Cnidaria, and Xenacoelomorpha relative to protostomes and deuterostomes are uncertain. LUCA, last universal common ancestor; Ch-MLCA, choanoflagellate-metazoan last common ancestor; Cn-BLCA cnidarian-bilaterian last common ancestor; P-DLCA, protostome-deuterostome last common ancestor; MYA, million years ago. Figure from Gaunt 2019 [6]. 2.2. Seeking sense in the maintenance and compactness of clusters Clusters are not always maintained. Some species may develop splits in the cluster, e.g. Drosophila where splits occur at different positions in separate sub-species [30]. However, the large regulatory regions between insect Hox genes probably limits the number of places that clusters can be broken without affecting gene function [31]. Other species, such as the urochordate Oikiopleura, show complete dispersal of the Hox cluster throughout the genome though, remarkably, the genes continue to show spatial collinearity in expression relative to the position that they occupied in the ancestral cluster [19]. Most species, however, continue to maintain clustering of their Hox genes, at least to some extent. One explanation for this is enhancer sharing between Hox genes. For example, the iab-5 regulatory region in Drosophila apparently regulates both abd-A and Abd-B [32]. Similarly, in mouse the CR3 enhancer regulates both Hoxb4 and Hoxb3 to become expressed up to the level of rhombomere 6/7 [33], and a separate Kreisler element then also drives Hoxb3 expression in rhombomere 5 [34]. These are examples of local enhancer sharing, where the enhancer lies within the Hox gene cluster. In vertebrates there are also long-range enhancers, usually located outside the Hox gene cluster, which can loop-in to regulate multiple Hox genes in a tissue-specific way [35,36]. Long range enhancers, seen in vertebrates, encourage compaction and exclusion of other, non-Hox, genes. Compact clusters of vertebrates (Figure 5) are likely a derived rather than ancestral condition [36]. Figure 5. Hox clusters are more compact in vertebrates than in non-vertebrates. Clusters shown are all apparently intact. Sizes shown are from: Ixodes [37]; Parhyale [38]; Apis [39]; Anopheles [37]; Nasonia [37]; Tribolium [40]; Saccoglossus [41]; sea urchin [42]; amphioxus [36]; mouse [37] . Other proposed reasons to maintain clustering are provided by the chromatin opening model, the Hox conjecture, both discussed below, and by the supposition that transcriptional activation and repression may each work best on nearby genes, as discussed in Section 2.3. Species that have lost their clustering have presumably overcome the above requirements. Transcriptional direction within Hox clusters is usually maintained, but the Dfd gene in Drosophila provides an example of gene inversion from the ancestral state (Figure 1). Inversion is normally selected against because the promoter of the inverted gene may fall under the regulation of neighbour gene enhancers, with serious consequences for development. For example, the well-known dominant Drosophila Antennapedia homeotic mutation in which legs develop instead of antennae is usually due to inversions in the Antp gene, causing it to become ectopically expressed by the enhancers of upstream, more anteriorly-expressed genes [43,44]. Darbellay et al. [45] inverted mouse Hoxd11 and d12, including displacement of a CTCF insulator element that normally separates them from Hoxd13. This caused anterior shift in expression from the Hoxd13 locus, possibly due to its mis-regulation from a Hoxd11/d12, or long-range, enhancer. These authors also describe a Hoxd11 inversion that robustly suppressed expression from its Hoxd12 neighbour, and they propose that this may be due to collision of transcription units on opposing DNA strands. 2.3. Seeking sense in spatial and temporal collinearities Spatial collinearity describes correspondence between order of genes along the cluster (3′ to 5′) and order of their expression domains (anterior-to-posterior) along the developing embryo. Temporal collinearity describes a correspondence between order of genes along the cluster (3′ to 5′) and time of their first expression (early to late) in the embryo. Many embryo types, including vertebrates, grow by successive addition of new parts at their posterior ends. That is, they develop in a head-to-tail temporal sequence from a posterior growth zone, and each new zone moves its overall Hox expression one step down in the pattern shown in Figure 2A, right. The ‘chromatin opening’ model was developed principally for embryos that develop from a posterior growth zone. It proposes that genes are expressed by progressive, timed, 3′ to 5′ opening of Hox cluster chromatin structure that permits and regulates the expression of Hox genes in their anterior-to-posterior temporal sequence (temporal collinearity) [46]. This means that temporal collinearity specifies the need for, and is dependent upon, spatial collinearity. If this was so ancestrally, then we can likely predict that the ancestral bilaterian already had temporal collinearity [47] and developed its body axis from a posterior growth zone [48]. While it is clear that there is indeed progressive opening of the Hox cluster as gastrulation proceeds [9], it is not obvious whether this is a cause or a consequence of Hox gene activation. Some experimental evidence against the chromatin opening model has been reviewed earlier [8]. More recent work in mouse embryo stem cell aggregates [49] indicates that CTCF-binding insulator elements are successively breached along the Hoxd cluster, with accompanying, progressive chromatin extrusion and change in Hox gene expression. This process in itself suggests a need for some level of spatial collinearity and would be in-keeping with the chromatin opening model. However, the authors note that spatial and temporal collinearities persist even after deletion of these insulator elements, and they conclude that insulators regulate the pace and precision of collinearities rather than their organization [49,50]. Duboule’s Hox Conjecture [51] proposes that for reasons yet unclear, but not necessarily dependent upon chromatin opening, spatial collinearity may still be necessary to achieve temporal collinearity in Hox expression. Evidence for this is largely circumstantial, and is that species displaying temporal collinearity have so far been found to have an intact, unbroken Hox gene cluster. Studies upon the annelid Urechis unicinctus perhaps provide some evidence against this [52]. Its posterior Hox genes (Lox2 and Post2) are separated from more anterior genes (Hox1 to Lox4) by greater than 1Mb, as sub-clusters, yet the genes as a whole display spatial and temporal collinearities [52]. Conversely, however, an intact cluster need not necessarily imply temporal collinearity. For example, the hemichordate Saccoglossus [41] has an intact cluster, but apparently does not show temporal collinearity [53]. An alternative ‘gene segregation’ model [8] was proposed to account for why spatial collinearity is largely maintained even in species not showing either temporal collinearity or posterior growth zone, including perhaps the ancestral bilaterian. It could also be called the ‘single boundary’ model. In this model, spatial collinearity is the primary feature and temporal collinearity follows as a necessary consequence in animals that develop from a posterior growth zone. This is so in these animals because anterior parts form first due to earliest expression of anterior Hox genes, which are 3′-located. Posterior parts form last due to late expression of posterior Hox genes, which are 5′-located. Overall, spatial collinearity thereby gives rise to temporal collinearity. The model does not dispute the importance of timing in Hox gene activation, especially so in species that develop from a posterior growth zone, but it does suggest that this may not depend primarily upon spatial collinearity, as it does in the chromatin opening model. Comparison of Figures 2A and 2B shows how spatial collinearity (Figure 2A) provides the minimum number (one) of boundaries between expressible and non-expressible Hox genes within the cluster at all positions along the body (Figure 3). The single boundary model proposes that the bilaterian cluster was established and maintained with spatial collinearity because this arrangement allows a number of potential benefits: see [8] and references therein, and now briefly summarized as follows. (i) Minimal boundaries means minimum risk that gene repressive factors, e.g. Pc protein, will inadvertently spread from inactive to active genes; and also minimum risk that gene activating factors, e.g. Trx protein, will spread from active to inactive genes. (ii) All active genes can more readily be grouped together in a nuclear transcription factory where there may be higher local concentrations of transcription and other factors necessary for gene expression. And all non-expressible genes can be grouped together in a nuclear Pc body where there may be higher local concentrations of repressive factors. (iii) Segregated active and inactive genes are less likely to interfere sterically with each other in their movement to transcription factories and Pc bodies. (iv) Contiguity of repressed Hox genes allows potential cross-spreading between them of repressive components. Hox gene collinearity, well known to be ubiquitous in bilaterians, therefore remains an enigmatic feature of Hox gene expression. The ancestral bilaterian cluster was likely to have had an incompact Hox cluster, with vertebrate clusters having since become compacted [36]. Vertebrate Hox genes, perhaps becoming mutually more entangled in their regulation, might therefore not provide the ideal system to make sense of the ancestral state. There remains a need to investigate collinearity in a wide range of species. 2.4. Seeking sense in the partially overlapping nature of Hox gene expression patterns There are three obvious features of Hox gene activity (Figures 2A, 6A, 7). First, each gene is usually expressed over a long A-P domain, and for segmented animals this means that it is usually expressed across multiple segments. Second, within these long domains there is usually a gradation of Hox expression intensity, often forming an A-P gradient. Third, the long expression domains for different Hox genes are usually partially overlapping along the head-tail axis. These features of Hox gene expression can greatly increase the number of possible positional addresses along the body. For example, a cluster of 13 Hox genes expressed without graded activity or overlap could provide only 14 positional addresses (if we assume that the most anterior address is absence of Hox gene expression). But if cells are able to monitor their intracellular concentrations of two or more Hox expressions then many more than 14 positional addresses become possible. The long domains of Hox gene expression allow structural features to be reiterated along the body. For example, forward walking legs in the crustacean Parhyale occur on adjacent segments that express Ubx but not abd-A (Figure 6A). Regions along the body of arthropods that show similar anatomical features are called tagmata. A change from one tagma to the next is typically accompanied, and caused (Sections 2.7, 2.8), by a change in the complement of Hox gene activity. Figure 6. Parhyale Hox gene expression patterns in appendages and their knockout phenotypes. (A) normal Hox expressions and appendage phenotypes [17], (B-E) Hox gene knockout (KO) appendages: Antp (B); Ubx (C); abd-A (D); Abd-B (E). Arrows indicate directionality of the homeotic changes. Reproduced/modified from Martin et al. [54], with permission of Elsevier. Similarly, along the mouse vertebral column a sudden change from one vertebral type to another is usually marked, and caused (Section 2.7), by the anterior expression boundary of a new Hox gene. For example, Hox6, Hox10 and Hox11 expression boundaries mark, respectively, the transitions from cervical to thoracic, thoracic to lumbar and lumbar to sacral vertebrae (Figure 7). Figure 7. Mouse Hoxa and/or Hoxd expression domains within the pre-vertebral column of mid-gestation embryos. Hox shading intensities indicate orthologue groups, as in Figure 1. Expression patterns are as described in the following references: Hoxa4-a10 [55]; Hoxa2 [56]; Hoxa3 [57]; Hoxa10 [58]; Hoxd10-d12 [14]. With an overlapping pattern of Hox gene expressions the question then arises as to whether two or more Hox genes expressed in a region are co-operative (combinatorial) in their actions, or whether one is dominant (hierarchical) in its effect over the other(s). Some examples are known of combinatorial Hox gene activity. In Parhyale, for example, abd-A and Abd-B are expressed in partially overlapping domains along the abdominal segments (Figure. 6A). Segments expressing abd-A in absence of Abd-B develop reverse walking legs; those expressing both abd-A and Abd-B develop swimmerets (also called pleopods); and those expressing only Abd-B develop uropods. Swimmerets apparently require combinatorial gene activity since animals mutant for either abd-A or Abd-B do not develop swimmerets (Figure 6D,E; Section 2.7). Although co-operating Hox genes are seen to be expressed in the same tissue it is not always clear whether or not they are also expressed in the same cells. More common are examples of how a Hox gene is dominant in its developmental effect over any more anteriorly expressed Hox genes, a mechanism known as ‘posterior prevalence’ [59,60]. Dominance may act at the level of suppressing a more anterior Hox gene’s expression, or only its function. The mechanisms are often unclear but at least some cases involve microRNAs or long non-coding RNAs [61]. In the Pc mutant of Drosophila, Abd-B expressed anterior to its normal domain is able to prevail in its function over those of all more anterior Hox genes, converting all segments to the morphology of the eighth abdominal segment (A8) [62]. Drosophila Ubx is normally expressed in parasegments 5 to 13 and notably acts to specify parasegment 6, visualized in A1. Over- and ectopic-expression of Ubx throughout the Drosophila embryo [63] results in the transformation of anterior segments towards an A1 morphology since Ubx function dominates over those of more anterior Hox genes. However, overexpression of Ubx in segments posterior to A1 does not affect their morphology since abd-A and Abd-B are dominant to Ubx. Many similar examples are seen in mouse. For example, Hox10 genes suppress the rib forming activity of more anteriorly expressed Hox genes in order to specify the rib-free lumbar region [64]. With an overlapping pattern of Hox gene expressions (Figure 2A) dominance of posterior genes makes sense: it allows for a sharp A-P boundary between one set of anatomical structures and the next. Crucially, it requires that posterior, dominant gene expression is excluded from anterior parts of the embryo, and this is the mechanism sustained by Pc repression (Figure 3). 2.5. Seeking sense in the head-tail polarity of Hox expression domains A partially overlapping pattern of Hox gene expression could be formed with spatial collinearity in the pattern shown in Figure 2A, but also as the alternative forms of Figures 8A-C. In scenario 8A the Hox genes lie in the opposite 3′ to 5′ relative positions from that found in nature (Figure 2A). While this may appear to be acceptable for embryo development it is also possible that it is less so due to read-through into more posterior, and dominant, Hox genes. Transcriptional read-through in the Figure 2A cluster can result only in sense transcripts for more anterior, and likely therefore repressed, Hox genes. There are several reports of transcriptional read-through between adjacent Hox genes: in crustaceans [65], in human HOXC6, C5 and C4, where the same 5′ non-coding exon is spliced on both HOXC6 and HOXC4 mature placental mRNAs [66], and in several other mammalian Hox genes where the function is unknown [67]. Figure 8. (A) to (C) depict possible head to tail Hox gene expression polarities that are alternatives to that of Figure 2A, but where each displays its own form of spatial collinearity. Arrows show directions of transcription. Scenarios 8B and 8C could work only if anterior (Hox1-end) genes had evolved dominant to posterior genes. In animals that develop in a head-to-tail temporal sequence there would be need for sequential suppression of Hox genes and their proteins. Sequential removal of Hox proteins from cells may be less easily controlled than their sequential activation, as is the case of Figure 2A. The fact that the cluster did not arise in the form of Figures 8B or 8C may be support for the suggestion that the ancestral animal did indeed develop in a head-to-tail temporal sequence [48]. Scenario 8C carries the possible additional problem of read-through into more dominant (this time, anterior) Hox genes. These arguments suggest that there is sense in the fact that the Hox cluster developed with the head-tail polarity of Figure 2A. However, we know that the Hox cluster evolved successfully only once, and it may have been by chance alone that this was in the way shown in Figure 2A. 2.6. Seeking sense in the duplication of Hox genes and clusters An increase in the number of genes within the cluster potentially provides more positional cues along the body. For example, one ancestral Hox gene duplicated to give Antp, Ubx and abd-A in Drosophila and, independently, Hox gene groups 6 to 8 in vertebrates (Figure 1). These genes have now acquired different roles and expressions (Sections 2.7, 2.8). Similarly, the ancestral Abd-B gene duplicated to give Hox gene groups 9 to 13 in vertebrates and some other deuterostomes (Figure 1). Most protostomes, however, acquired diversity without substantial change in Hox gene numbers [5,6]. More profound increase in Hox genes has been achieved in vertebrates by whole cluster duplications. Two rounds (2R) of whole genome duplication, likely occurring by chance in early vertebrates, gave rise to the four clusters seen today in mice and humans [68]. Analysis of the Hox genes present or missing in the four clusters indicates that the A and B and the C and D clusters are most likely to be sister clusters. That is, the 1R event resulted in two clusters (A,B) and (C,D). The 2R event then resulted in four clusters A, B, C and D [68]. Modern bony fish often have more than 4 clusters due to ancestral 3R and 4R events [69]. Several reasons can be suggested as to why cluster duplication was advantageous in vertebrate evolution. First, it permitted increased complexity along the head-tail axis. Thus, while expression studies [70] and functional analyses (Section 2.7) have revealed some cases where paralogous genes from different mouse clusters apparently share the same or similar anterior boundaries of expression and function (e.g., Hoxa4, b4, d4), in other cases paralogues have clearly adopted different expression boundaries (e.g., Hoxc4), suggesting that duplication could have provided opportunity for increased positional information along the head-tail axis [57]. Second, cluster duplication allowed individual clusters to acquire separate roles. This is suggested by observations that expression in developing spinal cord tracts and in internal organs is often similar for genes within a cluster but different for genes between clusters. As examples, Hoxb genes alone are confined in expression to the dorsal nerve tracts of the mouse embryo spinal cord [57]; Hoxc genes (c4 and c5) are expressed abundantly in the embryonic mouse oesophagus but not the trachea, while Hoxa genes (a3, a4, a5) are expressed in trachea but not oesophagus [71,72]. Similarly, Hoxc (c5, c6) and Hoxd (d4, d8, d9) genes are expressed in the embryonic mouse testis [73,74], while Hoxa and Hoxb genes are not. Third, cluster duplication may have been required in other ways for anatomical change in part of the vertebrate body. For example, heart tube folding and Hox cluster duplication are features present in vertebrates but not in non-vertebrate chordates. Heart folding required duplication to Hoxa and Hoxb clusters, and deletion of either from mice results in a heart tube that does not fold: an apparent atavistic change [75]. It is not certain whether heart folding is due to a gene dosage effect or to an acquired, cooperative, regulatory difference between the Hoxa and Hoxb clusters. The question arises as to why cluster duplications have provided selective advantage in vertebrates but not, typically, in non-vertebrate deuterostomes or most protostomes. Vertebrates are believed to be more constrained than other animals in the numbers and spacing of the Hox genes and regulatory elements within their clusters (Figure 5). This is likely due to their acquired dependence on long range enhancer elements which co-regulate multiple contiguous genes within each cluster (Section 2.2). Such regulations are correlated with vertebrate-specific traits [36,76,77]. In vertebrates, therefore, it is likely that numbers of Hox genes could increase further only by whole cluster duplications [78]. Just as evolving increased axial complexity has demanded rising numbers of Hox genes, so evolving increased axial simplicity may be accompanied by loss of Hox genes. For example, tardigades are protostomes, and ecdysozoans, that possess only four pairs of legs due to the apparent loss from the ancestral condition of more posterior leg-bearing segments. The Hox genes that specified the lost segments (Antp, Ubx and abd-A) are now absent from the tardigrade genome [79]. A similar example might be the acoel flatworms which are bilaterians possessing only three Hox genes that represent the anterior, middle and posterior groups of other bilaterians [80,81]. Acoels are sometimes regarded as degenerate deuterostomes [82,83]. A more intriguing view, however, is that acoels are primitive bilaterians which form part of a sister group to all other bilaterians [84]. Consistent with this, all members of this group lack features that are common to most other bilaterians such as an anus, kidney tubules and a circulatory system. The acoels might therefore be providing us with a glimpse into the early evolution of the bilaterian Hox gene cluster. 2.7. Seeking sense in the phenotype of Hox gene knockouts Reference to Figure 2A shows how partially overlapping domains of Hox expression along the body dictate that loss (e.g., by knockout) of a Hox gene results in its functional domain expressing the Hox profile normally expressed by the more anterior segment(s). In consequence, this region commonly duplicates the anatomy of a more anterior segment (producing a so-called anterior homeotic transformation). This is a prediction intrinsic in Lewis’s model. For example, the bxd mutation in Drosophila disrupts a normal pattern of Ubx expression in parasegment 6 [85,86]. In keeping with this, the bxd mutant has a pair of legs on the first abdominal segment, now specified by the Antp gene. As described below these same principles are seen in the results of Hox gene knockout studies in another arthropod, Parhyale, and in vertebrates. In Parhyale (Figure 6A), a crustacean, the T1 segment bears a pair of maxillipeds (feeding appendages, like the maxillae on head segments); T2-T3 bear prehensile claw-like chelipeds (legs bearing claws); T4-T5 bear forward walking legs; T6-T8 bear reverse walking legs; A1-A3 bear swimmerets; and A4-A6 bear uropods. Each of these morphologies corresponds with its own unique Hox expression profile as follows. 1) The maxillipeds of T1 express Antp but not Ubx. 2) The chelipeds of T2 and T3 express high levels of Antp but only low levels of Ubx. 3) The forward walking legs of T4 to T5 express high levels of Ubx, while the reverse walking legs on T6-T8 additionally express abd-A. 4) The swimmerets on A1-A3 express both abd-A and Abd-B. 5) The uropods on A4-A6 express only Abd-B. Knockout of Hox genes in Parhyale [54] generally supports the predictions of Lewis's model (Figure 2A). Knockout of Ubx results in partial transition of cheliped appendages on T2 and T3 to maxilliped morphology (Figure 6C), presumably under the influence of Antp. Knockout of abd-A results in only one leg type, forward walking leg, presumably formed under the influence of Ubx (Figure 6D). Knockout of Abd-B transforms swimmerets into reverse walking legs, presumably under influence of abd-A; and uropods into forward walking legs, perhaps due to loss of a normal Abd-B posterior prevalence over Ubx in the posterior segments (Figure 6E). In apparent contradiction to Lewis's model, knockout of Antp in Parhyale transforms T2 and T3 chelipeds into forward walking legs, a posterior homeotic transformation (Figure 6B). This could be explained if the low levels of Ubx expression in T2 and T3 are normally overridden by high levels of Antp (Figure 6A). Knockout of Hox genes in mouse generally supports the predictions of Lewis's model (Figure 2A). Hoxa4, b4 and d4 paralogues all share an anterior boundary of expression at the second cervical vertebra, C2 or axis (Figure 7) [70]. In the absence of Hoxa4, b4 and d4 (that is, absence of all six genes, both maternal and paternal copies) the first five cervical vertebrae assume a complete or partial C1 identity (Figure 9), presumably under the influence of Hox3 genes. The extent of this transformation varies with the dose of normal Hox4 genes [87]. Thus, in single mutants for Hoxb4 or Hoxd4 there is sometimes a partial transformation of C2 to C1 morphology. In Hoxb4/Hoxd4 double mutants C2 is almost completely transformed to a C1 identity, but if double mutants possess only a single copy of a normal Hoxb4 or Hoxd4 gene then the C2 to C1 transformation is less complete. Figure 9. (A) Anterior skeletal pattern of newborn mouse. (B) Skeletal pattern of mouse knocked out (KO) for six Hox4 genes (Hoxa4/b4/d4, maternal and paternal). The Hox9 group of mouse Hox genes, expressed as in Figure 7, has its main zone of influence in the posterior thoracic region. In the normal embryo the first 7 pairs of ribs attach to the sternum ventrally and the next 6 pairs attach either to the 7th pair or are unattached ventrally (Figure 10A). It appears that Hox9 genes are required for formation of non-sternal ribs. Thus, in absence of all eight Hox9 genes (that is, both maternal and paternal copies) the posterior thoracic vertebrae fall under the influence of the more anterior Hox genes resulting in about 13 pairs of ribs attached to the sternum (Figure 10B) [88]. In addition, thoracic-type vertebrae with unattached ribs replace the anterior lumbar vertebrae. As found for Hox4 genes the most severe phenotypes require extensive loss of normal genes within the Hox group. Leaving just one normal Hox9 gene in a mutant that is otherwise mutant for seven Hox9 genes results in a less severe phenotype. Figure 10. (A) Skeletal pattern of newborn mouse. (B) Mouse knocked out (KO) for all eight Hox9 genes (a9, b9, c9, d9, maternal and paternal). (C) Mouse KO for all six Hox10 genes (a10, c10, d10, maternal and paternal). (D) Mouse KO for all six Hox11 genes (a11, c11, d11, maternal and paternal). The Hox10 group of mouse Hox genes, expressed as in Figure 7, has its main influence in the lumbar region. This apparently requires suppression of more anterior Hox genes that fashion the thoracic (ribbed) phenotype. In absence of all six Hox10 genes (Hoxa10, c10, d10) [89], the lumbar and sacral regions fall under the influence of more anterior Hox genes resulting in thoracic type vertebrae with ribs instead of normal lumbar and sacral vertebrae (Figure 10C). The presence of only one functional Hox10 gene results in a much less severe phenotype, displaying only one extra pair of ribs. The Hox11 group of genes, expressed as in Figure 7, has its main influence in the sacral region. This apparently requires suppression of more anterior genes that fashion the lumbar phenotype. In absence of all six Hox11 genes (Hoxa11, c11, d11) [89], the lumbar and sacral regions fall under the influence of more anterior Hox genes resulting in lumbar type vertebrae instead of normal sacral vertebrae. The lumbar type vertebrae also replace the most anterior caudal vertebrae (Figure 10D). The presence of only one functional Hox11 gene results in a less severe phenotype. 2.8. Seeking sense in the evolution of axial morphologies by shifts in Hox expression We have already mentioned two ways that may lead on to an evolved change in axial patterning. First, generation of new Hox genes by duplication of either individual genes or entire clusters. These new genes may then acquire new expressions and functions (Sections 2.1, 2.7). Second, acquisition in vertebrates of long range enhancers located outside the ancestral Hox cluster (Section 2.2). These allow use of the Hox cluster, in addition to its role in head-tail patterning, to specify development along evolutionarily novel structures such as limbs and external genitalia [35]. However, the main objective in what follows is to examine the commonly utilized mechanism whereby axial morphologies may change by shifts in the expression domains of existing Hox genes, without prior need to generate new ones. The mechanism is especially clear in arthropods where axial patterning is highly variable, and is seen readily in the range of appendage types, even though the overall complement of Hox genes remains rather stable [37]. Figure 11 shows the expression domains of a variety of Hox genes in arthropods Drosophila, Artemia and Parhyale. In insects, Abd-B is expressed in the posterior-most segments of the abdomen [90] and abd-A is not expressed in the thorax [91]. In crustaceans, Abd-B is expressed throughout the abdomen and abd-A expression extends into the thorax [17,91,92]. It has been suggested that the pattern of Ubx, abd-A and Abd-B expressions in ancestral arthropods may have been similar to those found today in Artemia [91] (Figure 11). There is also variation between arthropod types in which of the Hox genes Ubx, abd-A and Abd-B are able to suppress limb development [92]. This may have occurred in a step-wise sequence over time, and it takes place most notably by their suppression of the Distalless (Dll) gene [93]. In Parhyale (Malacostra) none of these genes repress limb development. In Artemia (Brachiopoda) it has been suggested that only Abd-B represses limb development [92]. In Drosophila (Diptera) Ubx, abd-A and Abd-B all suppress limb development. Figure 11. Hox expressions in arthropods. Figure drawn mainly from Abzhanov and Kaufman [94], Averof and Akam [91], Serano et al.[17] and McCarthay-Taylor et al. [92]. T, thorax; A, abdomen. Figure from Gaunt 2019 [6]. How may the ancestral arthropod have evolved the diverse head-tail morphologies seen among arthropods today, bearing in mind the evolved changes in the roles of Ubx, abd-A and Abd-B as suppressors of Dll and limb development? Some likely possibilities are seen in Figures 6 and 11. First, in the ancestry of Drosophila a backward shift of Ubx may have facilitated formation of the three leg-bearing thoracic segments under the control of Antp (Figure 11). Second, a backward shift of Ubx in a Parhyale ancestor may have permitted activity of Antp in the anterior thorax, facilitating here the evolution of maxillipeds. On this topic, different crustacean species vary in the number of anterior thoracic segments where walking legs have become evolutionarily transformed into maxillipeds. An examination of 13 different crustacean species, where maxilliped numbers vary from 0 to 3 pairs, showed that changes in their anterior expression patterns of Ubx and abd-A correlate well with the axial level of the transition from walking leg to maxilliped [95]. Third, in the ancestry of Parhyale a backward shift in the anterior boundary of abd-A expression may have permitted development of forward walking legs, under control of the Ubx gene (Figures 6, 11). The Malacostra include the group Amphipoda (‘different foot’) which has diverse walking leg types (e.g., Parhyale), but also the group Isopoda (‘equal foot’) which typically has only one type of thoracic leg. In isopods, but not amphipods (Figure 11), abd-A extends over most of the thorax [94]. Vertebrate shifts in Hox expression, with corresponding morphological changes, have been shown most clearly in comparisons of mouse, chicken and goose. The position of the cervical/thoracic vertebral junction, and hence the length of the neck, has changed by shifts in expressions of Hox6 genes, known to regulate thoracic vertebra and rib development [96-98]. In an additional strategy, the rib cage of snakes extends posteriorly because Hox10 genes have lost their ability to block the thoracic phenotype. This is due to mutation in the Myf5 gene enhancer, which is normally a downstream target of Hox10 [99]. Most of the conclusions reached above regarding effects of evolutionary shifts in Hox expression are supported by the results of Hox gene knockout studies (Section 2.7). 2.9. Seeking sense in the Hox clustering of pre-bilaterians This article so far describes Hox gene clusters of bilaterally symmetrical animals (the Bilateria). Amongst pre-bilaterians, only Cnidaria possess Hox genes. These may include Hox genes orthologous with those of Bilateria: a group 1, a group 2, and a middle or posterior group Hox gene [100,101], suggesting that the bilaterian Hox cluster had already begun to form in the cnidarian/bilaterian ancestor, perhaps even to pattern its head-tail axis [102]. However, some authors question such close homology [103]. In the coral Acropora, all three Hox genes are linked in a single cluster, but in some other cnidarian species there is either no cluster or only clustering of the anterior genes [101]. The number of Hox genes often varies between different cnidarian species, and the particular genes present may also vary. Hox expression analyses on several cnidarians have shown no consistent evidence for collinearity, no consistency in expression patterns of orthologous Hox genes in different species, and therefore no evidence for a conserved cnidarian Hox code [100,103,104]. We can assume that our distant P-DLC bilaterian ancestor arose on an evolutionary pathway that had provided it with the core Hox gene cluster, collinear Hox expressions and, probably, a role for these expressions to specify pattern formation along its head-tail axis. It seems likely that alongside this ancestor other creatures were also experimenting with Hox genes, using them to specify body parts, and evolving ever more complex clusters. It is also likely that many of these were evolving Hox clusters with collinearity and so this alone seems unlikely to be the reason why one animal came to dominate and take over the world. Some other evolutionary event, perhaps acquisition of a three-layered body wall, or bilateral symmetry, may instead have provided the selective advantage to our ancestor which allowed it to prevail. The cnidarians with their two-layered body wall and radial symmetry might therefore represent animals ‘stuck’ at an early experimental stage of Hox cluster evolution. 3. Conclusions It is suggested here that there is much sense in the way that the Hox cluster of bilaterians arose in clusters by gene duplication, expressed its genes in overlapping patterns with a specific head-tail polarity, maintained its clustered Hox genes with the same directions of transcription, shifted its expressions for change in head-tail patterning, generates anterior homeotic shifts in Hox gene knockout studies, and was compacted and duplicated during vertebrate evolution. Hox gene collinearity, well known to be ubiquitous in bilaterians, remains an enigmatic feature of Hox gene expression. Two different models that attempt to make sense of it are discussed. References 1. Boncinelli, E.; Somma, R.; Acampora, D.; Pannese, M.; D'Esposito, M.; Faiella, A.; Simeone, A. Organization of human homeobox genes. Hum Reprod 1988, 3, 880-886. 2. Duboule, D.; Dolle, P. The structural and functional organization of the murine hox gene family resembles that of drosophila homeotic genes. The EMBO journal 1989, 8, 1497-1505. 3. Graham, A.; Papalopulu, N.; Krumlauf, R. The murine and drosophila homeobox gene complexes have common features of organization and expression. Cell 1989, 57, 367-378. 4. Balavoine, G.; de Rosa, R.; Adoutte, A. Hox clusters and bilaterian phylogeny. Molecular phylogenetics and evolution 2002, 24, 366-373. 5. Gaunt, S.J. Hox cluster genes and collinearities throughout the tree of animal life. The International journal of developmental biology 2018, 62, 673-683. 6. Gaunt, S.J. Made in the image of a fly. Amazon books and ebooks: 2019. 7. Lewis, E.B. A gene complex controlling segmentation in drosophila. Nature 1978, 276, 565-570. 8. Gaunt, S.J. The significance of hox gene collinearity. The International journal of developmental biology 2015, 59, 159-170. 9. Soshnikova, N.; Duboule, D. Epigenetic temporal control of mouse hox genes in vivo. Science 2009, 324, 1320-1323. 10. Noordermeer, D.; Leleu, M.; Splinter, E.; Rougemont, J.; De Laat, W.; Duboule, D. The dynamic architecture of hox gene clusters. Science 2011, 334, 222-225. 11. Bowman, S.K.; Deaton, A.M.; Domingues, H.; Wang, P.I.; Sadreyev, R.I.; Kingston, R.E.; Bender, W. H3k27 modifications define segmental regulatory domains in the drosophila bithorax complex. eLife 2014, 3, e02833. 12. Akam, M.; Dawson, I.; Tear, G. Homeotic genes and the control of segment diversity. Development Supplement 1988, 104, 123-133. 13. Maeda, R.K.; Karch, F. The open for business model of the bithorax complex in drosophila. Chromosoma 2015, 124, 293-307. 14. Izpisua-Belmonte, J.C.; Falkenstein, H.; Dolle, P.; Renucci, A.; Duboule, D. Murine genes related to the drosophila abdb homeotic genes are sequentially expressed during development of the posterior part of the body. The EMBO journal 1991, 10, 2279-2289. 15. Pascual-Anaya, J.; Adachi, N.; Alvarez, S.; Kuratani, S.; D'Aniello, S.; Garcia-Fernandez, J. Broken colinearity of the amphioxus hox cluster. EvoDevo 2012, 3, 28. 16. Frobius, A.C.; Matus, D.Q.; Seaver, E.C. Genomic organization and expression demonstrate spatial and temporal hox gene colinearity in the lophotrochozoan capitella sp. I. PloS one 2008, 3, e4004. 17. Serano, J.M.; Martin, A.; Liubicich, D.M.; Jarvis, E.; Bruce, H.S.; La, K.; Browne, W.E.; Grimwood, J.; Patel, N.H. Comprehensive analysis of hox gene expression in the amphipod crustacean parhyale hawaiensis. Developmental biology 2016, 409, 297-309. 18. Aboobaker, A.; Blaxter, M. Hox gene evolution in nematodes: Novelty conserved. Current opinion in genetics & development 2003, 13, 593-598. 19. Seo, H.C.; Edvardsen, R.B.; Maeland, A.D.; Bjordal, M.; Jensen, M.F.; Hansen, A.; Flaat, M.; Weissenbach, J.; Lehrach, H.; Wincker, P., et al. Hox cluster disintegration with persistent anteroposterior order of expression in oikopleura dioica. Nature 2004, 431, 67-71. 20. Gehring, W.J.; Kloter, U.; Suga, H. Evolution of the hox gene complex from an evolutionary ground state. Current topics in developmental biology 2009, 88, 35-61. 21. Lewis, E.B. The bithorax complex: The first fifty years. The International journal of developmental biology 1998, 42, 403-415. 22. Tarailo-Graovac, M.; Chen, N. Gene clustering in eukaryotes. In: eLS. John Wiley & Sons Ltd: Chichester. 2013. 23. Gaunt, S.J.; Gaunt, A.L. Possible rules for the ancestral origin of hox gene collinearity. Journal of theoretical biology 2016, 410, 1-8. 24. Holland, P.W.H. Evolution of homeobox genes. Wires Dev Biol 2013, 2, 31-45. 25. Hui, J.H.L.; McDougall, C.; Monteiro, A.S.; Holland, P.W.H.; Arendt, D.; Balavoine, G.; Ferrier, D.E.K. Extensive chordate and annelid macrosynteny reveals ancestral homeobox gene organization. Molecular biology and evolution 2012, 29, 157-165. 26. Pollard, S.L.; Holland, P.W.H. Evidence for 14 homeobox gene clusters in human genome ancestry. Current Biology 2000, 10, 1059-1062. 27. Garcia-Fernandez, J. The genesis and evolution of homeobox gene clusters. Nat Rev Genet 2005, 6, 881-892. 28. Ferrier, D.E. The origin of the hox/parahox genes, the ghost locus hypothesis and the complexity of the first animal. Briefings in functional genomics 2016, 15, 333-341. 29. Derelle, R.; Lopez, P.; Le Guyader, H.; Manuel, M. Homeodomain proteins belong to the ancestral molecular toolkit of eukaryotes. Evol Dev 2007, 9, 212-219. 30. Negre, B.; Ruiz, A. Hom-c evolution in drosophila: Is there a need for hox gene clustering? Trends in genetics : TIG 2007, 23, 55-59. 31. Shippy, T.D.; Hosmani, P.S.; Florez-Gonzalez, M.; Mueller, L.A.; Hunter, W.B.; Brown, S.J.; D'Elia, T.; Saha, S. Annotation of hox cluster and hox cofactor genes in the asian citrus psyllid, diaphorina citri, reveals novel features. Gigabyte 2022, 1-18. 32. Celniker, S.E.; Sharma, S.; Keelan, D.J.; Lewis, E.B. The molecular genetics of the bithorax complex of drosophila: Cis-regulation in the abdominal-b domain. The EMBO journal 1990, 9, 4277-4286. 33. Gould, A.; Morrison, A.; Sproat, G.; White, R.A.; Krumlauf, R. Positive cross-regulation and enhancer sharing: Two mechanisms for specifying overlapping hox expression patterns. Genes & development 1997, 11, 900-913. 34. Kwan, C.T.; Tsang, S.L.; Krumlauf, R.; Sham, M.H. Regulatory analysis of the mouse hoxb3 gene: Multiple elements work in concert to direct temporal and spatial patterns of expression. Developmental biology 2001, 232, 176-190. 35. Montavon, T.; Duboule, D. Chromatin organization and global regulation of hox gene clusters. Philosophical transactions of the Royal Society of London. Series B, Biological sciences 2013, 368, 20120367. 36. Duboule, D. The rise and fall of hox gene clusters. Development 2007, 134, 2549-2560. 37. Pace, R.M.; Grbic, M.; Nagy, L.M. Composition and genomic organization of arthropod hox clusters. EvoDevo 2016, 7, 11. 38. Sun, D.A.; Patel, N.H. The amphipod crustacean parhyale hawaiensis: An emerging comparative model of arthropod development, evolution, and regeneration. Wiley interdisciplinary reviews. Developmental biology 2019, 8, e355. 39. Dearden, P.K.; Wilson, M.J.; Sablan, L.; Osborne, P.W.; Havler, M.; McNaughton, E.; Kimura, K.; Milshina, N.V.; Hasselmann, M.; Gempe, T., et al. Patterns of conservation and change in honey bee developmental genes. Genome research 2006, 16, 1376-1384. 40. Shippy, T.D.; Ronshaugen, M.; Cande, J.; He, J.; Beeman, R.W.; Levine, M.; Brown, S.J.; Denell, R.E. Analysis of the tribolium homeotic complex: Insights into mechanisms constraining insect hox clusters. Development genes and evolution 2008, 218, 127-139. 41. Freeman, R.; Ikuta, T.; Wu, M.; Koyanagi, R.; Kawashima, T.; Tagawa, K.; Humphreys, T.; Fang, G.C.; Fujiyama, A.; Saiga, H., et al. Identical genomic organization of two hemichordate hox clusters. Current biology : CB 2012, 22, 2053-2058. 42. Arenas-Mena, C.; Cameron, A.R.; Davidson, E.H. Spatial expression of hox cluster genes in the ontogeny of a sea urchin. Development 2000, 127, 4631-4643. 43. Jorgensen, E.M.; Garber, R.L. Function and misfunction of the two promoters of the drosophila antennapedia gene. Genes & development 1987, 1, 544-555. 44. Schneuwly, S.; Kuroiwa, A.; Gehring, W.J. Molecular analysis of the dominant homeotic antennapedia phenotype. The EMBO journal 1987, 6, 201-206. 45. Darbellay, F.; Bochaton, C.; Lopez-Delisle, L.; Mascrez, B.; Tschopp, P.; Delpretti, S.; Zakany, J.; Duboule, D. The constrained architecture of mammalian hox gene clusters. Proceedings of the National Academy of Sciences of the United States of America 2019, 116, 13424-13433. 46. Duboule, D. Temporal colinearity and the phylotypic progression: A basis for the stability of a vertebrate bauplan and the evolution of morphologies through heterochrony. Development Supplement 1994, 135-142. 47. Ferrier, D.E.; Holland, P.W. Ciona intestinalis parahox genes: Evolution of hox/parahox cluster integrity, developmental mode, and temporal colinearity. Molecular phylogenetics and evolution 2002, 24, 412-417. 48. Gold, D.A.; Runnegar, B.; Gehling, J.G.; Jacobs, D.K. Ancestral state reconstruction of ontogeny supports a bilaterian affinity for dickinsonia. Evol Dev 2015, 17, 315-324. 49. Rekaik, H.; Lopez-Delisle, L.; Hintermann, A.; Mascrez, B.; Bochaton, C.; Duboule, D. Sequential and directional insulation by conserved ctcf sites underlies the hox timer in pseudo-embryos. bioRkiv 2022. 50. Amandio, A.R.; Beccari, L.; Lopez-Delisle, L.; Mascrez, B.; Zakany, J.; Gitto, S.; Duboule, D. Sequential in cis mutagenesis in vivo reveals various functions for ctcf sites at the mouse hoxd cluster. Genes & development 2021, 35, 1490-1509. 51. Duboule, D. The (unusual) heuristic value of hox gene clusters; a matter of time? Developmental biology 2022, 484, 75-87. 52. Wei, M.; Qin, Z.; Kong, D.; Liu, D.; Zheng, Q.S.; Bai, S.; Zhang, Z.; Ma, Y. Echiuran hox genes provide new insights into the correspondence between hox subcluster organization and collinearity pattern. Proc. Roy. Soc. B. 289 20220705. 2022. 53. Aronowicz, J.; Lowe, C.J. Hox gene expression in the hemichordate saccoglossus kowalevskii and the evolution of deuterostome nervous systems. Integrative and comparative biology 2006, 46, 890-901. 54. Martin, A.; Serano, J.M.; Jarvis, E.; Bruce, H.S.; Wang, J.; Ray, S.; Barker, C.A.; O'Connell, L.C.; Patel, N.H. Crispr/cas9 mutagenesis reveals versatile roles of hox genes in crustacean limb specification and evolution. Current biology : CB 2016, 26, 14-26. 55. Hautier, L.; Charles, C.; Asher, R.J.; Gaunt, S.J. Ossification sequence and genetic patterning in the mouse axial skeleton. Journal of experimental zoology. Part B, Molecular and developmental evolution 2014, 322, 631-642. 56. Rijli, F.M.; Mark, M.; Lakkaraju, S.; Dierich, A.; Dolle, P.; Chambon, P. A homeotic transformation is generated in the rostral branchial region of the head by disruption of hoxa-2, which acts as a selector gene. Cell 1993, 75, 1333-1349. 57. Gaunt, S.J. Expression patterns of mouse hox genes: Clues to an understanding of developmental and evolutionary strategies. BioEssays : news and reviews in molecular, cellular and developmental biology 1991, 13, 505-513. 58. Gaunt, S.J.; Dean, W.; Sang, H.; Burton, R.D. Evidence that hoxa expression domains are evolutionarily transposed in spinal ganglia, and are established by forward spreading in paraxial mesoderm. Mechanisms of development 1999, 82, 109-118. 59. Duboule, D. Patterning in the vertebrate limb. Current opinion in genetics & development 1991, 1, 211-216. 60. Duboule, D.; Morata, G. Colinearity and functional hierarchy among genes of the homeotic complexes. Trends in genetics : TIG 1994, 10, 358-364. 61. Mallo, M.; Alonso, C.R. The regulation of hox gene expression during animal development. Development 2013, 140, 3951-3963. 62. Paro, R. Imprinting a determined state into the chromatin of drosophila. Trends in genetics : TIG 1990, 6, 416-421. 63. Gonzalez-Reyes, A.; Morata, G. The developmental effect of overexpressing a ubx product in drosophila embryos is dependent on its interactions with other homeotic products. Cell 1990, 61, 515-522. 64. Carapuco, M.; Novoa, A.; Bobola, N.; Mallo, M. Hox genes specify vertebral types in the presomitic mesoderm. Genes & development 2005, 19, 2116-2121. 65. Shiga, Y.; Sagawa, K.; Takai, R.; Sakaguchi, H.; Yamagata, H.; Hayashi, S. Transcriptional readthrough of hox genes ubx and antp and their divergent post-transcriptional control during crustacean evolution. Evol Dev 2006, 8, 407-414. 66. Simeone, A.; Pannese, M.; Acampora, D.; D'Esposito, M.; Boncinelli, E. At least three human homeoboxes on chromosome 12 belong to the same transcription unit. Nucleic acids research 1988, 16, 5379-5390. 67. Mainguy, G.; Koster, J.; Woltering, J.; Jansen, H.; Durston, A. Extensive polycistronism and antisense transcription in the mammalian hox clusters. PloS one 2007, 2, e356. 68. Pascual-Anaya, J.; D'Aniello, S.; Kuratani, S.; Garcia-Fernandez, J. Evolution of hox gene clusters in deuterostomes. BMC developmental biology 2013, 13, 26. 69. Mehta, T.K.; Ravi, V.; Yamasaki, S.; Lee, A.P.; Lian, M.M.; Tay, B.H.; Tohari, S.; Yanai, S.; Tay, A.; Brenner, S., et al. Evidence for at least six hox clusters in the japanese lamprey (lethenteron japonicum). Proceedings of the National Academy of Sciences of the United States of America 2013, 110, 16044-16049. 70. Gaunt, S.J.; Krumlauf, R.; Duboule, D. Mouse homeo-genes within a subfamily, hox-1.4, -2.6 and -5.1, display similar anteroposterior domains of expression in the embryo, but show stage- and tissue-dependent differences in their regulation. Development 1989, 107, 131-141. 71. Gaunt, S.J.; Sharpe, P.T.; Duboule, D. Spatially restricted domains of homeo-gene transcripts in mouse embryos - relation to a segmented body plan. Development Supplement 1988, 104, 169-179. 72. Gaunt, S.J.; Coletta, P.L.; Pravtcheva, D.; Sharpe, P.T. Mouse hox-3.4: Homeobox sequence and embryonic expression patterns compared with other members of the hox gene network. Development 1990, 109, 329-339. 73. Geada, A.M.; Gaunt, S.J.; Azzawi, M.; Shimeld, S.M.; Pearce, J.; Sharpe, P.T. Sequence and embryonic expression of the murine hox-3.5 gene. Development 1992, 116, 497-506. 74. Izpisua-Belmonte, J.C.; Dolle, P.; Renucci, A.; Zappavigna, V.; Falkenstein, H.; Duboule, D. Primary structure and embryonic expression pattern of the mouse hox-4.3 homeobox gene. Development 1990, 110, 733-745. 75. Soshnikova, N.; Dewaele, R.; Janvier, P.; Krumlauf, R.; Duboule, D. Duplications of hox gene clusters and the emergence of vertebrates. Developmental biology 2013, 378, 194-199. 76. Di-Poi, N.; Zakany, J.; Duboule, D. Distinct roles and regulations for hoxd genes in metanephric kidney development. PLoS genetics 2007, 3, e232. 77. Montavon, T.; Soshnikova, N.; Mascrez, B.; Joye, E.; Thevenet, L.; Splinter, E.; de Laat, W.; Spitz, F.; Duboule, D. A regulatory archipelago controls hox genes transcription in digits. Cell 2011, 147, 1132-1145. 78. Wagner, G.P.; Amemiya, C.; Ruddle, F. Hox cluster duplications and the opportunity for evolutionary novelties. Proceedings of the National Academy of Sciences of the United States of America 2003, 100, 14603-14606. 79. Smith, F.W.; Boothby, T.C.; Giovannini, I.; Rebecchi, L.; Jockusch, E.L.; Goldstein, B. The compact body plan of tardigrades evolved by the loss of a large body region. Current biology : CB 2016, 26, 224-229. 80. Moreno, E.; Nadal, M.; Baguna, J.; Martinez, P. Tracking the origins of the bilaterian hox patterning system: Insights from the acoel flatworm symsagittifera roscoffensis. Evol Dev 2009, 11, 574-581. 81. Hejnol, A.; Martindale, M.Q. Coordinated spatial and temporal expression of hox genes during embryogenesis in the acoel convolutriloba longifissura. BMC biology 2009, 7, 65. 82. Philippe, H.; Brinkmann, H.; Copley, R.R.; Moroz, L.L.; Nakano, H.; Poustka, A.J.; Wallberg, A.; Peterson, K.J.; Telford, M.J. Acoelomorph flatworms are deuterostomes related to xenoturbella. Nature 2011, 470, 255-258. 83. Philippe, H.; Poustka, A.J.; Chiodin, M.; Hoff, K.J.; Dessimoz, C.; Tomiczek, B.; Schiffer, P.H.; Muller, S.; Domman, D.; Horn, M., et al. Mitigating anticipated effects of systematic errors supports sister-group relationship between xenacoelomorpha and ambulacraria. Current biology : CB 2019, 29, 1818-1826 e1816. 84. Cannon, J.T.; Vellutini, B.C.; Smith, J., 3rd; Ronquist, F.; Jondelius, U.; Hejnol, A. Xenacoelomorpha is the sister group to nephrozoa. Nature 2016, 530, 89-93. 85. Castelli-Gair, J.E.; Capdevila, M.P.; Micol, J.L.; Garcia-Bellido, A. Positive and negative cis-regulatory elements in the bithoraxoid region of the drosophila ultrabithorax gene. Molecular & general genetics : MGG 1992, 234, 177-184. 86. Maeda, R.K.; Karch, F. The abc of the bx-c: The bithorax complex explained. Development 2006, 133, 1413-1422. 87. Horan, G.S.; Ramirez-Solis, R.; Featherstone, M.S.; Wolgemuth, D.J.; Bradley, A.; Behringer, R.R. Compound mutants for the paralogous hoxa-4, hoxb-4, and hoxd-4 genes show more complete homeotic transformations and a dose-dependent increase in the number of vertebrae transformed. Genes & development 1995, 9, 1667-1677. 88. McIntyre, D.C.; Rakshit, S.; Yallowitz, A.R.; Loken, L.; Jeannotte, L.; Capecchi, M.R.; Wellik, D.M. Hox patterning of the vertebrate rib cage. Development 2007, 134, 2981-2989. 89. Wellik, D.M.; Capecchi, M.R. Hox10 and hox11 genes are required to globally pattern the mammalian skeleton. Science 2003, 301, 363-367. 90. Sanchez-Herrero, E.; Crosby, M.A. The abdominal-b gene of drosophila melanogaster: Overlapping transcripts exhibit two different spatial distributions. The EMBO journal 1988, 7, 2163-2173. 91. Averof, M.; Akam, M. Hox genes and the diversification of insect and crustacean body plans. Nature 1995, 376, 420-423. 92. McCarthy-Taylor, J.B.; Kelly, S.R.; VanHook, A.M.; Marques-Souza, H.; Serano, J.M.; Patel, N.H. Expression of abdominal-b in the brine shrimp, artemia franciscana, expands our evolutionary understanding of the crustacean abdomen. Developmental biology 2022, 489, 178-184. 93. Gebelein, B.; Culi, J.; Ryoo, H.D.; Zhang, W.; Mann, R.S. Specificity of distalless repression and limb primordia development by abdominal hox proteins. Developmental cell 2002, 3, 487-498. 94. Abzhanov, A.; Kaufman, T.C. Crustacean (malacostracan) hox genes and the evolution of the arthropod trunk. Development 2000, 127, 2239-2249. 95. Averof, M.; Patel, N.H. Crustacean appendage evolution associated with changes in hox gene expression. Nature 1997, 388, 682-686. 96. Burke, A.C.; Nelson, C.E.; Morgan, B.A.; Tabin, C. Hox genes and the evolution of vertebrate axial morphology. Development 1995, 121, 333-346. 97. Gaunt, S.J. Conservation in the hox code during morphological evolution. The International journal of developmental biology 1994, 38, 549-552. 98. Vinagre, T.; Moncaut, N.; Carapuco, M.; Novoa, A.; Bom, J.; Mallo, M. Evidence for a myotomal hox/myf cascade governing nonautonomous control of rib specification within global vertebral domains. Developmental cell 2010, 18, 655-661. 99. Guerreiro, I.; Nunes, A.; Woltering, J.M.; Casaca, A.; Novoa, A.; Vinagre, T.; Hunter, M.E.; Duboule, D.; Mallo, M. Role of a polymorphism in a hox/pax-responsive enhancer in the evolution of the vertebrate spine. Proceedings of the National Academy of Sciences of the United States of America 2013, 110, 10682-10686. 100. Chiori, R.; Jager, M.; Denker, E.; Wincker, P.; Da Silva, C.; Le Guyader, H.; Manuel, M.; Queinnec, E. Are hox genes ancestrally involved in axial patterning? Evidence from the hydrozoan clytia hemisphaerica (cnidaria). PloS one 2009, 4, e4231. 101. DuBuc, T.Q.; Ryan, J.F.; Shinzato, C.; Satoh, N.; Martindale, M.Q. Coral comparative genomics reveal expanded hox cluster in the cnidarian-bilaterian ancestor. Integrative and comparative biology 2012, 52, 835-841. 102. DuBuc, T.Q.; Stephenson, T.B.; Rock, A.Q.; Martindale, M.Q. Hox and wnt pattern the primary body axis of an anthozoan cnidarian before gastrulation. Nature communications 2018, 9, 2007. 103. Kamm, K.; Schierwater, B.; Jakob, W.; Dellaporta, S.L.; Miller, D.J. Axial patterning and diversification in the cnidaria predate the hox system. Current biology : CB 2006, 16, 920-926. 104. Reddy, P.C.; Unni, M.K.; Gungi, A.; Agarwal, P.; Galande, S. Evolution of hox-like genes in cnidaria: Study of hydra hox repertoire reveals tailor-made hox-code for cnidarians. Mechanisms of development 2015, 138 Pt 2, 87-96. J. Dev. Biol. 2022, 10, x. https://doi.org/10.3390/xxxxx www.mdpi.com/journal/jdb image2.emf 12345678910111213 3′5′ DrosophilaLast commonancestorMouseHoxaHoxbHoxcHoxd ANT-C labpbDfdScrAntpUbxAbd-Babd-A BX-C zen2zenbcdftz image3.emf HeadTailHox gene expressiondomains along thehead-tail axisHox gene clustershowing gene activitiesat different body positionsHeadTail AB 123456123456 Active or expressibleHox geneInactive or non-expressibleHox gene 123456123456HoxHoxHoxHox 3′5′3′5′ image4.emf Hox1Hox13 Hoxa, Hoxc, Hoxd openclosed Hoxb Hox1Hox13 image5.emf PD ancestor(570 MYA)XenacoelomorphaUr-bilateria(600-615 MYA)CnidariaCn-B LCA(600-630 MYA)PoriferaChoano-flagellataUr-metazoanPlantsCh-M LCA Originof metazoan-specific homeobox gene classese.g.,ANTP, PRD, POU, LIMPossible origins of Hox geneOrigin of Hox collinearity3 Hox genes Ur-eukaryotaProkaryotesLUCA Origin of first homeobox gene. 7+ Hox gene cluster DeuterostomeFungi Independent proliferationof homeobox genes withadvent of multicellularity ProtostomeHemichordateChordate image6.emf 01000200030004000 HoxaHoxbHoxcHoxdAmphioxus (cephalochordate)Sea urchin (echinoderm)Tribolium(beetle)Apis(bee)Nasonia(wasp)Ixodes(tick)Mouse (vertebrate) DNA base pairs (kb) Anopheles(mosquito)Saccoglossus(hemichordate)Parhyale(crustacean) image7.emf Abd-BKO feeding appendagesclawsforwardreversewalking legsswimmeretsuropodsantennae AntpKOUbxKO ABCE abd-AKO D image8.emf 17821234569101112131415161718192022232425262728293031323334 CervicalvertebraeThoracicvertebraeLumbarvertebraeSacralvert.Caudal vertebraeSkull d12d11a10, d10a9a7a6a5a4a3a2 image9.emf 2Hox6543154Hox6321HeadTail524Hox63123Hox14565Hox123462Hox65431HeadTailHeadTail 3′5′3′5′3′5′ Hox gene expressiondomains along thehead-tail axisHox gene clustershowing gene activitiesat different body positions Active or expressibleHox geneInactive or non-expressibleHox gene ABC image10.emf A cervicalVertebrae1 to 7sternum1 st thoracicvertebrafore-limbskull B Normal Hox4 KO image11.emf NormalHox11 KOHox9 KOHox10 KO ABCD 1 st lumbar vertebra1 st sacralvertebrahind-limb1 st caudalvertebracervicalVertebrae1 to 7sternum1 st thoracicvertebrafore-limbskull image12.emf AntpUbxabd-AAbd-B InsectBranchiopoda(e.g. Artemia)Crustacean Antp, Ubx, abd-AAbd-B Hox gene duplication Arthropod ancestor AntpUbxabd-AAbd-BAntpUbxabd-AAbd-B (e.g. Drosophila)Malacostra(e.g. Parhyale) AntpUbxabd-AAbd-B TATATA image1.png image13.png image14.png