[Yt-svn] yt: 6 new changesets

Thu Jan 13 08:08:27 PST 2011

hg Repository: yt
details:   yt/rev/52d72f7ad064
changeset: 3641:52d72f7ad064
user:      Matthew Turk <matthewturk at gmail.com>
date:
Sun Jan 09 00:03:27 2011 -0500
description:
Adding initial pass of star rendering.  The code has not been re-cythonized.
This uses a new, very simple kD-tree from kdtree.googlecode.com.

Several shortcomings:

  1) Quality is not very high
  2) All stars are white, all of uniform radius
  3) The rendering is quite slow
  4) The adaptive integration stepsize seems to be responsible for the odd artifacts.

It's a bit of a proof of concept that doesn't quite work out just yet.

hg Repository: yt
details:   yt/rev/c2bd5ac0db01
changeset: 3642:c2bd5ac0db01
user:      Matthew Turk <matthewturk at gmail.com>
date:
Sun Jan 09 21:27:21 2011 -0500
description:
Clear up the naming scheme for the sigma quantity.

hg Repository: yt
details:   yt/rev/36631ebdc090
changeset: 3643:36631ebdc090
user:      Matthew Turk <matthewturk at gmail.com>
date:
Wed Jan 12 18:33:22 2011 -0500
description:
Adding in (mandatory) color support for particles.  The artifact at grid
boundaries is still present, but only appears when compiled with O3, not O2.  I
am looking into it.

hg Repository: yt
details:   yt/rev/e1651647403a
changeset: 3644:e1651647403a
user:      Matthew Turk <matthewturk at gmail.com>
date:
Thu Jan 13 00:33:29 2011 -0500
description:
Removing an aliasing issue that seemed to show up as a huge grid artiface in O3
(or O2 with the full GCSE optimizations on).

hg Repository: yt
details:   yt/rev/8271bb094a2a
changeset: 3645:8271bb094a2a
user:      Sam Skillman <sam.skillman at gmail.com>
date:
Thu Jan 13 08:54:43 2011 -0700
description:
Modifying amr_kdtree to work with star particles, which involved adding a AMRKDTree.bricks object.

hg Repository: yt
details:   yt/rev/8bc321156516
changeset: 3646:8bc321156516
user:      Matthew Turk <matthewturk at gmail.com>
date:
Thu Jan 13 11:08:18 2011 -0500
description:
Merging

diffstat:

 doc/docs_html.zip                                           |    0 
 yt/analysis_modules/halo_finding/halo_objects.py            |   27 +-
 yt/analysis_modules/halo_merger_tree/api.py                 |    3 +
 yt/analysis_modules/halo_merger_tree/enzofof_merger_tree.py |  215 +++
 yt/data_objects/data_containers.py                          |    3 +-
 yt/data_objects/universal_fields.py                         |   17 +-
 yt/utilities/_amr_utils/VolumeIntegrator.pyx                |   86 +-
 yt/utilities/_amr_utils/kdtree.c                            |  761 +++++++++++
 yt/utilities/_amr_utils/kdtree.h                            |  114 +
 yt/utilities/_amr_utils/kdtree_utils.pxd                    |   52 +
 yt/utilities/amr_kdtree/amr_kdtree.py                       |    4 +-
 yt/utilities/kd.py                                          |  240 ---
 yt/utilities/pykdtree.py                                    |  823 ++++++++++++
 yt/utilities/setup.py                                       |    4 +-
 14 files changed, 2080 insertions(+), 269 deletions(-)

diffs (truncated from 2547 to 300 lines):

diff -r 6570505f0d17 -r 8bc321156516 doc/docs_html.zip
Binary file doc/docs_html.zip has changed
diff -r 6570505f0d17 -r 8bc321156516 yt/analysis_modules/halo_finding/halo_objects.py

--- a/yt/analysis_modules/halo_finding/halo_objects.py	Thu Jan 06 15:25:48 2011 -0600
+++ b/yt/analysis_modules/halo_finding/halo_objects.py	Thu Jan 13 11:08:18 2011 -0500
@@ -1505,7 +1505,8 @@
         
         Parameters
         ----------
-        pf : EnzoStaticOutput object
+        pf : `StaticOutput`
+            The parameter file on which halo finding will be conducted.
         threshold : float
             The density threshold used when building halos. Default = 160.0.
         dm_only : bool
@@ -1779,10 +1780,12 @@
         
         Parameters
         ----------
-        pf : EnzoStaticOutput object
-        subvolume : A region over which HOP will be run, which can be used
-            to run HOP on a subvolume of the full volume. Default = None,
-            which defaults to the full volume automatically.
+        pf : `StaticOutput`
+            The parameter file on which halo finding will be conducted.
+        subvolume : `yt.data_objects.api.AMRData`, optional
+            A region over which HOP will be run, which can be used to run HOP
+            on a subvolume of the full volume. Default = None, which defaults
+            to the full volume automatically.
         threshold : float
             The density threshold used when building halos. Default = 160.0.
         dm_only : bool
@@ -1795,7 +1798,7 @@
             in code units. Default = 0.02.
         
         Examples
-        -------
+        --------
         >>> pf = load("RedshiftOutput0000")
         >>> halos = HaloFinder(pf)
         """
@@ -1857,10 +1860,12 @@
         
         Parameters
         ----------
-        pf : EnzoStaticOutput object
-        subvolume : A region over which FOF will be run, which can be used
-            to run FOF on a subvolume of the full volume. Default = None,
-            which defaults to the full volume automatically.
+        pf : `StaticOutput`
+            The parameter file on which halo finding will be conducted.
+        subvolume : `yt.data_objects.api.AMRData`, optional
+            A region over which HOP will be run, which can be used to run HOP
+            on a subvolume of the full volume. Default = None, which defaults
+            to the full volume automatically.
         link : float
             If positive, the interparticle distance (compared to the overall
             average) used to build the halos. If negative, this is taken to be
@@ -1876,7 +1881,7 @@
             in code units. Default = 0.02.
         
         Examples
-        -------
+        --------
         >>> pf = load("RedshiftOutput0000")
         >>> halos = FOFHaloFinder(pf)
         """
diff -r 6570505f0d17 -r 8bc321156516 yt/analysis_modules/halo_merger_tree/api.py
--- a/yt/analysis_modules/halo_merger_tree/api.py	Thu Jan 06 15:25:48 2011 -0600
+++ b/yt/analysis_modules/halo_merger_tree/api.py	Thu Jan 13 11:08:18 2011 -0500
@@ -36,3 +36,6 @@
     Link, \
     MergerTreeDotOutput, \
     MergerTreeTextOutput
+
+from .enzofof_merger_tree import \
+    find_halo_relationships
diff -r 6570505f0d17 -r 8bc321156516 yt/analysis_modules/halo_merger_tree/enzofof_merger_tree.py
--- /dev/null	Thu Jan 01 00:00:00 1970 +0000
+++ b/yt/analysis_modules/halo_merger_tree/enzofof_merger_tree.py	Thu Jan 13 11:08:18 2011 -0500
@@ -0,0 +1,215 @@
+"""
+A very simple, purely-serial, merger tree script that knows how to parse FOF
+catalogs output by Enzo and then compare parent/child relationships.
+
+Author: Matthew J. Turk <matthewturk at gmail.com>
+Affiliation: NSF / Columbia
+Homepage: http://yt.enzotools.org/
+License:
+  Copyright (C) 2010-2011 Matthew Turk.  All Rights Reserved.
+
+  This file is part of yt.
+
+  yt is free software; you can redistribute it and/or modify
+  it under the terms of the GNU General Public License as published by
+  the Free Software Foundation; either version 3 of the License, or
+  (at your option) any later version.
+  
+  This program is distributed in the hope that it will be useful,
+  but WITHOUT ANY WARRANTY; without even the implied warranty of
+  MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+  GNU General Public License for more details.
+  
+  You should have received a copy of the GNU General Public License
+  along with this program.  If not, see <http://www.gnu.org/licenses/>.
+"""
+
+# First pass at a simplified merger tree
+#
+# Basic outline:
+#
+# 1. Halo find inline, obtaining particle catalogs
+# 2. Load dataset at time t
+# 3. Load dataset at time t+1
+# 4. Parse catalogs for t and t+1
+# 5. Place halos for t+1 in kD-tree
+# 6. For every halo in t, execute ball-query with some linking length
+# 7. For every halo in ball-query result, execute numpy's intersect1d on
+#    particle IDs
+# 8. Parentage is described by a fraction of particles that pass from one to
+#    the other; we have both descendent fractions and ancestory fractions. 
+
+import numpy as na
+import h5py
+import time
+import pdb
+import cPickle
+
+from yt.funcs import *
+from yt.utilities.pykdtree import KDTree
+
+# We don't currently use this, but we may again find a use for it in the
+# future.
+class MaxLengthDict(dict):
+    def __init__(self, *args, **kwargs):
+        dict.__init__(self, *args, **kwargs)
+        self.order = [None] * 50
+
+    def __setitem__(self, key, val):
+        if key not in self.order:
+            to_remove = self.order.pop(0)
+            self.pop(to_remove, None)
+        self.order.append(key)
+        dict.__setitem__(self, key, val)
+
+    def __getitem__(self, key):
+        if key in self.order:
+            self.order.pop(self.order.index(key))
+            self.order.append(key)
+        return dict.__getitem__(self, key)
+
+    def __delitem__(self, key):
+        dict.__delitem__(self, key)
+        self.order.pop(self.order.index(key))
+        self.order.insert(0, None)
+
+class HaloCatalog(object):
+    cache = None
+    def __init__(self, output_id, cache = True):
+        r"""A catalog of halos, parsed from EnzoFOF outputs.
+
+        This class will read in catalogs output by the Enzo FOF halo finder and
+        make available their positions, radii, etc.  Enzo FOF was provided
+        starting with 2.0, and can be run either inline (with the correct
+        options) or as a postprocessing step using the `-F` command line
+        option.  This class is mostly useful when calculating a merger tree,
+        and when the particle IDs for members of a given halo are output as
+        well.
+
+        Parameters
+        ----------
+        output_id : int
+            This is the integer output id of the halo catalog to parse and
+            load.
+        cache : bool
+            Should we store, in between accesses, the particle IDs?  If set to
+            true, the correct particle files must exist.
+        """
+        self.output_id = output_id
+        self.particle_file = h5py.File("FOF/particles_%04i.h5" % output_id, "r")
+        self.parse_halo_catalog()
+        if cache: self.cache = dict()#MaxLengthDict()
+
+    def parse_halo_catalog(self):
+        hp = []
+        for line in open("FOF/groups_%04i.dat" % self.output_id):
+            if line.strip() == "": continue # empty
+            if line[0] == "#": continue # comment
+            if line[0] == "d": continue # datavar
+            x,y,z = [float(f) for f in line.split(None, 3)[:-1]]
+            hp.append([x,y,z])
+        self.halo_positions = na.array(hp)
+        self.halo_kdtree = KDTree(self.halo_positions)
+        return hp
+
+    def read_particle_ids(self, halo_id):
+        if self.cache is not None:
+            if halo_id not in self.cache:
+                self.cache[halo_id] = self.particle_file["/Halo%08i/Particle ID" % halo_id][:]
+            ids = self.cache[halo_id]
+        else:
+            ids = self.particle_file["/Halo%08i/Particle ID" % halo_id][:]
+        return HaloParticleList(halo_id, self.halo_positions[halo_id,:], ids)
+
+    def calculate_parentage_fractions(self, other_catalog, radius = 0.10):
+        parentage_fractions = {}
+        mylog.debug("Ball-tree query with radius %0.3e", radius)
+        all_nearest = self.halo_kdtree.query_ball_tree(
+            other_catalog.halo_kdtree, radius)
+        pbar = get_pbar("Halo Mergers", HC1.halo_positions.shape[0])
+        for hid1, nearest in enumerate(all_nearest):
+            pbar.update(hid1)
+            parentage_fractions[hid1] = {}
+            HPL1 = self.read_particle_ids(hid1)
+            for hid2 in nearest:
+                HPL2 = other_catalog.read_particle_ids(hid2)
+                p1, p2 = HPL1.find_relative_parentage(HPL2)
+                parentage_fractions[hid1][hid2] = (p1, p2)
+        pbar.finish()
+        return parentage_fractions
+
+class HaloParticleList(object):
+    def __init__(self, halo_id, position, particle_ids):
+        self.halo_id = halo_id
+        self.position = na.array(position)
+        self.particle_ids = particle_ids
+
+    def find_nearest(self, other_tree, radius = 0.10):
+        return other_tree.query_ball_point(self.position, radius)
+
+    def find_relative_parentage(self, child):
+        # Return two values: percent this halo gave to the other, and percent
+        # of the other that comes from this halo
+        overlap = na.intersect1d(self.particle_ids, child.particle_ids).size
+        of_child_from_me = float(overlap)/child.particle_ids.size
+        of_mine_from_me = float(overlap)/self.particle_ids.size
+        return of_child_from_me, of_mine_from_me
+
+def find_halo_relationships(output1_id, output2_id, output_basename = None,
+                            radius = 0.10):
+    r"""Calculate the parentage and child relationships between two EnzoFOF
+    halo catalogs.
+
+    This function performs a very simple merger tree calculation between two
+    sets of halos.  For every halo in the second halo catalog, it looks to the
+    first halo catalog to find the parents by looking at particle IDs.  The
+    particle IDs from the child halos are identified in potential parents, and
+    then both percent-of-parent and percent-to-child values are recorded.
+
+    Note that this works only with catalogs constructed by Enzo's FOF halo
+    finder, not with catalogs constructed by yt.
+
+    Parameters
+    ----------
+    output1_id : int
+        This is the integer output id of the (first) halo catalog to parse and
+        load.
+    output2_id : int
+        This is the integer output id of the (second) halo catalog to parse and
+        load.
+    output_basename : string
+        If provided, both .cpkl and .txt files containing the parentage
+        relationships will be output.
+    radius : float, default to 0.10
+        In absolute units, the radius to examine when guessing possible
+        parent/child relationships.  If this value is too small, you will miss
+        possible relationships.
+
+    Returns
+    -------
+    pfrac : dict
+        This is a dict of dicts.  The first key is the parent halo id, the
+        second is the child halo id.  The values are the percent contributed
+        from parent to child and the percent of a child that came from the
+        parent.
+    """
+    mylog.info("Parsing Halo Catalog %04i", output1_id)
+    HC1 = HaloCatalog(output1_id, False)
+    mylog.info("Parsing Halo Catalog %04i", output2_id)
+    HC2 = HaloCatalog(output2_id, True)
+    mylog.info("Calculating fractions")
+    pfrac = HC1.calculate_parentage_fractions(HC2)
+
+    if output_basename is not None:
+        f = open("%s.txt" % (output_basename), "w")
+        for hid1 in sorted(pfrac):
+            for hid2 in sorted(pfrac[hid1]):
+                p1, p2 = pfrac[hid1][hid2]
+                if p1 == 0.0: continue
+                f.write( "Halo %s (%s) contributed %0.3e of its particles to %s (%s), which makes up %0.3e of that halo\n" % (
+                            hid1, output1_id, p2, hid2, output2_id, p1))
+        f.close()
+
+        cPickle.dump(pfrac, open("%s.cpkl" % (output_basename), "wb"))
+
+    return pfrac
diff -r 6570505f0d17 -r 8bc321156516 yt/data_objects/data_containers.py
--- a/yt/data_objects/data_containers.py	Thu Jan 06 15:25:48 2011 -0600
+++ b/yt/data_objects/data_containers.py	Thu Jan 13 11:08:18 2011 -0500
@@ -31,6 +31,7 @@