代码之家 › 专栏 › 技术社区 › solub

在Python2.7中实现“波折叠函数”算法的问题

procedural-generation python-2.7 algorithm python

solub · 技术社区 · 5 年前

简而言之:

我对 Wave Collapse Function algorithm

什么是 波浪崩塌函数

它是由Maxm GuMin在2016中编写的一种算法,它可以从样本图像生成过程模式。你可以看到它在起作用 here (二维重叠模型)和 here (三维平铺模型)。

将算法(二维重叠模型)归结为其本质,避免了 original C# script (出人意料的长和难读)。这是一个尝试,使一个更短,更清楚和蟒蛇版本的算法。

我在用 Processing (Python模式),一个用于可视化设计的软件,它使图像操作更容易(没有PIL,没有Matplotlib,…)。主要缺点是我仅限于Python 2.7,无法导入numpy。

与原始版本不同,此实现:

正在使用一维数组而不是二维数组

算法 (据我所知)

读取输入位图,存储每个NxN模式并统计其发生次数。 ( 可选: 使用旋转和反射来增强图案数据。)

2个/ 预先计算并存储模式之间的所有可能的邻接关系。在下面的示例中,模式207、242、182和125可以与模式246的右侧重叠

3个/ W 对于wave)。这个数组的每个元素都是一个保持状态的数组( True 属于 False

例如,假设我们计算输入中的326个唯一模式,我们希望输出的维度为20×20(400个单元格)。然后“Wave”数组将包含400个(20x20)数组,每个数组都包含326个boolan值。

一开始,所有的布尔人都被设置为 真的

W = [[True for pattern in xrange(len(patterns))] for cell in xrange(20*20)]

4个/ H ). 这个数组的每个元素都是一个浮点数,它保存输出中相应单元格的“熵”值。

这里的熵是指 Shannon Entropy 真的

例如,要计算单元22的熵,我们看它在波中的对应指数( W[22] 真的 H[22]

小时 真的 ,对于每个单元格。

H = [entropyValue for cell in xrange(20*20)]

核心

5个/ 观察:

找到单元格的索引非零

然后,在波的相应索引处查看仍然有效的模式,并随机选择其中一个,根据模式在输入图像中出现的频率进行加权(加权选择)。

例如,如果 小时 H[22] 真的 西[22]

6个/

我们现在用最小熵将所选模式的索引分配给单元格。也就是说波中相应位置的每一个图案都被设置为 假 除了被选中的那个。

246 在里面 西[22] 真的 假 . 细胞 22 是指定的模式 . 在输出单元22中,将填充图案246的第一种颜色(左上角)。 (本例中为蓝色)

7个/

由于相邻约束,该模式选择会对波中的相邻单元产生影响。与最近折叠的单元格左右、顶部和上方的单元格相对应的布尔数组需要相应地更新。

例如,如果单元格 22个 246个 ,然后 W[21] W[23] (右), W[2] (向上)和 W[42] (向下)必须修改,以便它们只保持 真的 与模式相邻的模式 246个 .

例如,回顾步骤2的图片,我们可以看到只有模式207、242、182和125可以放置在模式246。也就是说 西[23] (牢房右侧 )需要保持207242 182和125模式 真的 并将数组中的所有其他模式设置为 假 . 如果这些模式不再有效(已设置为 假 由于之前的约束),则算法将面对矛盾

八/

因为单元格已折叠(选择了一个模式,设置为 )其周围的单元格也相应地更新(将非相邻模式设置为 )所有这些细胞的熵都改变了,需要重新计算。(请记住,一个单元的熵与它在波中保持的有效模式数相关。)

H[22] = 0 ,因为只有模式 246个 设置为 真的 在 西[22] )相邻细胞的熵也降低了(与模式不相邻的模式已设置为 ).

现在,算法到达第一次迭代的末尾,将在步骤5(找到具有最小非零熵的单元格)到步骤8(更新熵)之间循环,直到所有单元格都折叠。

你需要 Processing 具有 Python mode 已安装以运行此脚本。 input image 并相应地改变第16行的路径。

from collections import Counter
from itertools import chain, izip
import math

d = 20  # dimensions of output (array of dxd cells)
N = 3 # dimensions of a pattern (NxN matrix)

Output = [120 for i in xrange(d*d)] # array holding the color value for each cell in the output (at start each cell is grey = 120)

def setup():
    size(800, 800, P2D)
    textSize(11)

    global W, H, A, freqs, patterns, directions, xs, ys, npat

    img = loadImage('Flowers.png') # path to the input image
    iw, ih = img.width, img.height # dimensions of input image
    xs, ys = width//d, height//d # dimensions of cells (squares) in output
    kernel = [[i + n*iw for i in xrange(N)] for n in xrange(N)] # NxN matrix to read every patterns contained in input image
    directions = [(-1, 0), (1, 0), (0, -1), (0, 1)] # (x, y) tuples to access the 4 neighboring cells of a collapsed cell
    all = [] # array list to store all the patterns found in input



    # Stores the different patterns found in input
    for y in xrange(ih):
        for x in xrange(iw):

            ''' The one-liner below (cmat) creates a NxN matrix with (x, y) being its top left corner.
                This matrix will wrap around the edges of the input image.
                The whole snippet reads every NxN part of the input image and store the associated colors.
                Each NxN part is called a 'pattern' (of colors). Each pattern can be rotated or flipped (not mandatory). '''


            cmat = [[img.pixels[((x+n)%iw)+(((a[0]+iw*y)/iw)%ih)*iw] for n in a] for a in kernel]

            # Storing rotated patterns (90Â°, 180Â°, 270Â°, 360Â°) 
            for r in xrange(4):
                cmat = zip(*cmat[::-1]) # +90Â° rotation
                all.append(cmat) 

            # Storing reflected patterns (vertical/horizontal flip)
            all.append(cmat[::-1])
            all.append([a[::-1] for a in cmat])




    # Flatten pattern matrices + count occurences 

    ''' Once every pattern has been stored,
        - we flatten them (convert to 1D) for convenience
        - count the number of occurences for each one of them (one pattern can be found multiple times in input)
        - select unique patterns only
        - store them from less common to most common (needed for weighted choice)'''

    all = [tuple(chain.from_iterable(p)) for p in all] # flattern pattern matrices (NxN --> [])
    c = Counter(all)
    freqs = sorted(c.values()) # number of occurences for each unique pattern, in sorted order
    npat = len(freqs) # number of unique patterns
    total = sum(freqs) # sum of frequencies of unique patterns
    patterns = [p[0] for p in c.most_common()[:-npat-1:-1]] # list of unique patterns sorted from less common to most common



    # Computes entropy

    ''' The entropy of a cell is correlated to the number of possible patterns that cell holds.
        The more a cell has valid patterns (set to 'True'), the higher its entropy is.
        At start, every pattern is set to 'True' for each cell. So each cell holds the same high entropy value'''

    ent = math.log(total) - sum(map(lambda x: x * math.log(x), freqs)) / total



    # Initializes the 'wave' (W), entropy (H) and adjacencies (A) array lists

    W = [[True for _ in xrange(npat)] for i in xrange(d*d)] # every pattern is set to 'True' at start, for each cell
    H = [ent for i in xrange(d*d)] # same entropy for each cell at start (every pattern is valid)
    A = [[set() for dir in xrange(len(directions))] for i in xrange(npat)] #see below for explanation




    # Compute patterns compatibilities (check if some patterns are adjacent, if so -> store them based on their location)

    ''' EXAMPLE:
    If pattern index 42 can placed to the right of pattern index 120,
    we will store this adjacency rule as follow:

                     A[120][1].add(42)

    Here '1' stands for 'right' or 'East'/'E'

    0 = left or West/W
    1 = right or East/E
    2 = up or North/N
    3 = down or South/S '''

    # Comparing patterns to each other
    for i1 in xrange(npat):
        for i2 in xrange(npat):
            for dir in (0, 2):
                if compatible(patterns[i1], patterns[i2], dir):
                    A[i1][dir].add(i2)
                    A[i2][dir+1].add(i1)


def compatible(p1, p2, dir):

    '''NOTE: 
    what is refered as 'columns' and 'rows' here below is not really columns and rows 
    since we are dealing with 1D patterns. Remember here N = 3'''

    # If the first two columns of pattern 1 == the last two columns of pattern 2 
    # --> pattern 2 can be placed to the left (0) of pattern 1
    if dir == 0:
        return [n for i, n in enumerate(p1) if i%N!=2] == [n for i, n in enumerate(p2) if i%N!=0]

    # If the first two rows of pattern 1 == the last two rows of pattern 2
    # --> pattern 2 can be placed on top (2) of pattern 1
    if dir == 2:
        return p1[:6] == p2[-6:]



def draw():    # Equivalent of a 'while' loop in Processing (all the code below will be looped over and over until all cells are collapsed)
    global H, W, grid

    ### OBSERVATION
    # Find cell with minimum non-zero entropy (not collapsed yet)

    '''Randomly select 1 cell at the first iteration (when all entropies are equal), 
       otherwise select cell with minimum non-zero entropy'''

    emin = int(random(d*d)) if frameCount <= 1 else H.index(min(H)) 



    # Stoping mechanism

    ''' When 'H' array is full of 'collapsed' cells --> stop iteration '''

    if H[emin] == 'CONT' or H[emin] == 'collapsed': 
        print 'stopped'
        noLoop()
        return



    ### COLLAPSE
    # Weighted choice of a pattern

    ''' Among the patterns available in the selected cell (the one with min entropy), 
        select one pattern randomly, weighted by the frequency that pattern appears in the input image.
        With Python 2.7 no possibility to use random.choice(x, weight) so we have to hard code the weighted choice '''

    lfreqs = [b * freqs[i] for i, b in enumerate(W[emin])] # frequencies of the patterns available in the selected cell
    weights = [float(f) / sum(lfreqs) for f in lfreqs] # normalizing these frequencies
    cumsum = [sum(weights[:i]) for i in xrange(1, len(weights)+1)] # cumulative sums of normalized frequencies
    r = random(1)
    idP = sum([cs < r for cs in cumsum])  # index of selected pattern 

    # Set all patterns to False except for the one that has been chosen   
    W[emin] = [0 if i != idP else 1 for i, b in enumerate(W[emin])]

    # Marking selected cell as 'collapsed' in H (array of entropies)
    H[emin] = 'collapsed' 

    # Storing first color (top left corner) of the selected pattern at the location of the collapsed cell
    Output[emin] = patterns[idP][0]



    ### PROPAGATION
    # For each neighbor (left, right, up, down) of the recently collapsed cell
    for dir, t in enumerate(directions):
        x = (emin%d + t[0])%d
        y = (emin/d + t[1])%d
        idN = x + y * d #index of neighbor

        # If that neighbor hasn't been collapsed yet
        if H[idN] != 'collapsed': 

            # Check indices of all available patterns in that neighboring cell
            available = [i for i, b in enumerate(W[idN]) if b]

            # Among these indices, select indices of patterns that can be adjacent to the collapsed cell at this location
            intersection = A[idP][dir] & set(available) 

            # If the neighboring cell contains indices of patterns that can be adjacent to the collapsed cell
            if intersection:

                # Remove indices of all other patterns that cannot be adjacent to the collapsed cell
                W[idN] = [True if i in list(intersection) else False for i in xrange(npat)]


                ### Update entropy of that neighboring cell accordingly (less patterns = lower entropy)

                # If only 1 pattern available left, no need to compute entropy because entropy is necessarily 0
                if len(intersection) == 1: 
                    H[idN] = '0' # Putting a str at this location in 'H' (array of entropies) so that it doesn't return 0 (float) when looking for minimum entropy (min(H)) at next iteration


                # If more than 1 pattern available left --> compute/update entropy + add noise (to prevent cells to share the same minimum entropy value)
                else:
                    lfreqs = [b * f for b, f in izip(W[idN], freqs) if b] 
                    ent = math.log(sum(lfreqs)) - sum(map(lambda x: x * math.log(x), lfreqs)) / sum(lfreqs)
                    H[idN] = ent + random(.001)


            # If no index of adjacent pattern in the list of pattern indices of the neighboring cell
            # --> mark cell as a 'contradiction'
            else:
                H[idN] = 'CONT'



    # Draw output

    ''' dxd grid of cells (squares) filled with their corresponding color.      
        That color is the first (top-left) color of the pattern assigned to that cell '''

    for i, c in enumerate(Output):
        x, y = i%d, i/d
        fill(c)
        rect(x * xs, y * ys, xs, ys)

        # Displaying corresponding entropy value
        fill(0)
        text(H[i], x * xs + xs/2 - 12, y * ys + ys/2)

问题

20x20输出示例

模式分布与邻接约束要被尊重(与输入相同数量的蓝色、绿色、黄色和棕色有点图案:水平地面,绿色茎)。

经常断开连接
通常不完整(缺少由4个黄色花瓣组成的“头”)
遇到太多矛盾的状态(标记为“CONT”的灰色单元格)

关于最后一点,我应该澄清,矛盾的状态是正常的,但应该很少发生(如第6页中所述 this 纸张和 this 文章)

这让我觉得某物 . 要么我错误地实现了这些步骤中的一个,要么我丢失了逻辑中的一个关键元素。

此外,欢迎任何基于提供的脚本(使用或不使用处理)的回答

有用的附加资源:

这个细节 article 斯蒂芬·谢拉特的解释 paper 来自Karth&Smith。另外,为了便于比较,我建议检查一下这个 Python implementation

注:我尽我所能使这个问题尽可能清楚(用gif和插图进行全面解释,用有用的链接和资源对代码进行完全注释),但如果出于某些原因,你决定投反对票,请留下简短的评论来解释你为什么这么做。

0 回复 | 直到 5 年前

solub 5 年前

由@mbrig和@Leon提出的假设是正确的,即传播步骤在整个细胞堆栈上迭代(而不是仅限于4个直接邻居)。以下是在回答我自己的问题时试图提供进一步的细节。

特定单元格的索引是 依次替换为先前更新的邻居的索引
这个级联过程是
最后一个 只要特定小区的相邻模式在其相邻小区的1个中可用

递归的

详细算法

一旦单元格被折叠,它的索引就被放在堆栈中。那一堆是后来的意思暂时邻近细胞的储存指数

stack = set([emin]) #emin = index of cell with minimum entropy that has been collapsed

只要堆栈中充满索引,传播就将持续:

while stack:

pop() 堆栈中包含的最后一个索引(目前是唯一的索引),并获取其4个相邻单元格(E、W、N、S)的索引。我们必须让他们在边界内,并确保他们包围。

while stack:
    idC = stack.pop() # index of current cell
    for dir, t in enumerate(mat):
        x = (idC%w + t[0])%w
        y = (idC/w + t[1])%h
        idN = x + y * w  # index of neighboring cell

在继续之前,我们确保相邻单元格尚未折叠(我们不想更新只有一个可用模式的单元格):

        if H[idN] != 'c':

能够放在那个地方。例:如果相邻的单元格位于当前单元格的左侧(东侧),我们将查看当前单元格中包含的每个模式的左侧可以放置的所有模式。

            possible = set([n for idP in W[idC] for n in A[idP][dir]])

我们还研究了是

            available = W[idN]

现在我们要确保邻近的牢房 真正地 必须更新。如果它的所有可用模式都已在所有可能模式的列表中,则无需更新它(算法跳过此邻居并转到下一个):

            if not available.issubset(possible):

不一个子集 possible 交叉在这两个集合中(所有可以放置在该位置的模式和“幸运的”模式都可以在同一位置使用):

                intersection = possible & available

如果它们不相交(模式本来可以放置在那里,但不可用),这意味着我们遇到了“矛盾”。我们必须停止整个WFC算法。

                if not intersection:
                    print 'contradiction'
                    noLoop()

相反,如果它们确实相交-->我们将使用优化的模式索引列表更新相邻单元格:

                W[idN] = intersection

因为相邻的单元已经更新,所以它的熵也必须更新:

                lfreqs = [freqs[i] for i in W[idN]]
                H[idN] = (log(sum(lfreqs)) - sum(map(lambda x: x * log(x), lfreqs)) / sum(lfreqs)) - random(.001)

最后,最重要的是,我们将相邻单元格的索引添加到堆栈中,使其成为下一个 现在的 依次是单元格(其邻居将在下一个单元格中更新 while 循环):

                stack.add(idN)

from collections import Counter
from itertools import chain
from random import choice

w, h = 40, 25
N = 3

def setup():
    size(w*20, h*20, P2D)
    background('#FFFFFF')
    frameRate(1000)
    noStroke()

    global W, A, H, patterns, freqs, npat, mat, xs, ys

    img = loadImage('Flowers.png') 
    iw, ih = img.width, img.height
    xs, ys = width//w, height//h
    kernel = [[i + n*iw for i in xrange(N)] for n in xrange(N)]
    mat = ((-1, 0), (1, 0), (0, -1), (0, 1))
    all = []

    for y in xrange(ih):
        for x in xrange(iw):
            cmat = [[img.pixels[((x+n)%iw)+(((a[0]+iw*y)/iw)%ih)*iw] for n in a] for a in kernel]
            for r in xrange(4):
                cmat = zip(*cmat[::-1])
                all.append(cmat)
                all.append(cmat[::-1])
                all.append([a[::-1] for a in cmat])

    all = [tuple(chain.from_iterable(p)) for p in all] 
    c = Counter(all)
    patterns = c.keys()
    freqs = c.values()
    npat = len(freqs) 

    W = [set(range(npat)) for i in xrange(w*h)] 
    A = [[set() for dir in xrange(len(mat))] for i in xrange(npat)]
    H = [100 for i in xrange(w*h)] 

    for i1 in xrange(npat):
        for i2 in xrange(npat):
            if [n for i, n in enumerate(patterns[i1]) if i%N!=(N-1)] == [n for i, n in enumerate(patterns[i2]) if i%N!=0]:
                A[i1][0].add(i2)
                A[i2][1].add(i1)
            if patterns[i1][:(N*N)-N] == patterns[i2][N:]:
                A[i1][2].add(i2)
                A[i2][3].add(i1)


def draw():    
    global H, W

    emin = int(random(w*h)) if frameCount <= 1 else H.index(min(H)) 

    if H[emin] == 'c': 
        print 'finished'
        noLoop()

    id = choice([idP for idP in W[emin] for i in xrange(freqs[idP])])
    W[emin] = [id]
    H[emin] = 'c' 

    stack = set([emin])
    while stack:
        idC = stack.pop() 
        for dir, t in enumerate(mat):
            x = (idC%w + t[0])%w
            y = (idC/w + t[1])%h
            idN = x + y * w 
            if H[idN] != 'c': 
                possible = set([n for idP in W[idC] for n in A[idP][dir]])
                if not W[idN].issubset(possible):
                    intersection = possible & W[idN] 
                    if not intersection:
                        print 'contradiction'
                        noLoop()
                        return

                    W[idN] = intersection
                    lfreqs = [freqs[i] for i in W[idN]]
                    H[idN] = (log(sum(lfreqs)) - sum(map(lambda x: x * log(x), lfreqs)) / sum(lfreqs)) - random(.001)
                    stack.add(idN)

    fill(patterns[id][0])
    rect((emin%w) * xs, (emin/w) * ys, xs, ys)

整体改善

除了这些修复之外,我还做了一些小的代码优化,以加快观察和传播步骤,并缩短加权选择计算。

套属于当单元格“折叠”时其大小减小(替换固定大小的大型布尔值列表)。
熵存储在 默认听写 其密钥被逐步删除。
开始的熵值被一个随机整数代替(不需要第一个熵计算,因为开始时的不确定性水平相当高)
单元格显示一次(避免将其存储在数组中并在每帧重新绘制)
加权选择现在是一行(避免了几行不必要的列表理解)

mbrig 5 年前

当看着 live demo 链接到你的一个例子中,基于对原始算法代码的快速回顾,我相信你的错误在于“传播”步骤。

原始算法的实际C#代码实现相当复杂,我不完全理解它,但关键点似乎是创建“传播器”对象 here ,以及传播函数本身, here